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Gesellschaf t fur Biotechnologische Forschung mbH 



Tubulysin-Biosynthese-Gene 

Tubulysine sind bereits. als eine neue, auf das Tubulin-Skelett 
wirkende Substanzf amilie aus Myxobakterien in Irsee vorgestellt 
worden; vgl. PCT/EP 97/05095 und DE 100 08 08 9.8 und die darin 
angeftihrte Literatur. Im Gegensatz zu den Epothilonen zeigen 
diese eine mikrotubuli-abbauende Wirkung sowie die vermehrte 
Ausbildung von Zentrosomen. Mit einer Cytotoxizitat von IC 50 = 
10 - 500 pg sind die Tubulysine als potentielle Cytostatika von 
besonderem Interesse . 

Die Tubulysine haben eine cytostatische oder antimitotische 
Wirkung auf Pilze, humane Tumore oder Krebszellinien und andere 
tierische Zellkulturen (vgl/ Tabelle) . Sie fuhren in den Zellen 
zu einem raschen Abbau des Mikrotubuli-Gerusts . Das Aktinskelett 
bleibt erhalten. Adharent wachsende L929-Maus-Zellen vergrofiern 
unter dem Einfluft der Tubulysine ihr Zellvolumen, ohne sich zu 
teilen, und entwickeln groJie Zellkerne, die dann in einem 
apoptischen Vorgang zer fallen. 
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Wirkunsspektrum 



Pilze 



Aspergillus niger 
Botrytis cineria 
Coprinus cinereus 
Pythinum debaryanum 



Hemmhof [mm] 



Tubulysin A Tubulysin B 



20 
23 
20 
20 



18 
18 



Agardif f usiontest : 20 ug pro Testblattchen 6 mm Durchmesser 



Humane Krebszellinine 



KB-3-1 

(DSM ACC 158) 
K-562 

(ATCC CCL 243) 
HL-60 

(ATCC-CCL 240) 



0,1 
0,04 



IC 50 [ng/ml] 



Tubulysin A Tubulysin B Tubulysin C 
0,01 0,02 0,1 



0,2 
0, 08 



1,5 
0,4 



Tierische Zellinien 



L929, Maus 0,2 0,4 2 

(ATCC CCL1) 



Pt K2, Potorous tri- 
dactylis 
(ATCC CCL 56) 



0,2 



0,2 



2 
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Gemafi einer Ausf uhrungsf orm betrifft die Erfindung ein ss-DNA- 
Molekul, ausgewahlt aus der folgenden Gruppe: 

( i) ss-DNA-Molekul mit einer Sequenz gemali Figur 1; 

(ii) ss-DNA-Molekul, das mit einem ss-DNA-MolekUl gemaii (i) 
hinsichtlich seiner Nucleotid-Anzahl oder seiner 
Nucleotid-Sequenz zu jeweils 90, 91, 92, 93, 94, 95, 
96, 97, 98, 99 oder 100% homolog ist, jedoch von dem 
ss-DNA-Molekul gemali (i) hinsichtlich seiner 
Nucleotid-Anzahl und/oder seiner Nucleotid-Sequenz in 
mindestens eineni Nucleotid abweicht; und 

(iii) ss-DNA-Molekul mit einer Sequenz, die zur Sequenz 
eines ss-DNA-Molekuls gem£fi (i) oder (ii) komplementar 
ist . 

Ferner betrifft die Erfindung ein ds-DNA-Molekul aus einem 
erf indungsgemaften ss-DNA-Molekul und einem dazu komplementaren 
Strang . 

Gemali einer weiteren Ausf iihrungsf orm betrifft die Erfindung ein 
ss-DNA-Molekul, ausgewahlt aus der folgenden Gruppe: 

(i) ss-DNA-Molektil mit einer Sequenz der Positionen 3.308 
bis 1 (ORF 16) der Sequenz gemafi Figur 1; 

(ii) ss-DNA-Molekul mit einer Sequenz der Positionen 4706 
bis 3453 (ORF 15) der Sequenz gemali Figur 1; 



(iii) ss-DNA-Molektil mit einer Sequenz der Positionen 5719 

bis 7164 (ORF 14) der Sequenz gemali Figur 1; 



WO 2004/022586 




PCT/EP2003/009780 



(iv) ss-DNA-Molekul mit einer Sequenz der Positionen 9557 
bis 7317 (ORF 13) der Sequenz gemafi Figur 1; 

(v) ss-DNA-Molekul mit einer Sequenz der Positionen 12193 
bis 10550 (ORF 12) der Sequenz gemafi Figur 1; 

<vi) ss-DNA-Molekul mit einer Sequenz der Positionen 12841 

bis 13881 (ORF 11) der Sequenz gemafi Figur 1; 

(vii) ss-DNA-Molekul mit einer Sequenz der Positionen 14833 
bis 13835 (ORF 10) der Sequenz gemafi Figur 1; 

(viii) ss-DNA-Molekul mit einer Sequenz der Positionen 14942 
bis 15586 (ORF 9) der Sequenz gemafi Figur 1; 

(ix) ss-DNA-Molekul mit einer Sequenz der Positionen 15847 
bis 16983 (ORF 8) der Sequenz gemafi Figur 1/ 

(x) ss-DNA-Molekul mit einer Sequenz der Positionen 21154 
bis 18809 (ORF 7) der Sequenz gemafi Figur 1; 

(xi) ss-DNA-Molekul mit einer Sequenz der Positionen 22366 
bis 23532 (ORF 6) der Sequenz gemafi Figur 1; 

(xii) ss-DNA-Molekul mit einer Sequenz der Positionen 24591 
bis 26513 (ORF 5) der Sequenz gemafi Figur 1; 



(xiii) 



ss-DNA-Molekul mit einer Sequenz der Positionen 26597 
bis 27517 (ORF 4) der Sequenz gemafi Figur 1; 
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(xiv) ss-DNA-Molekul mit einer Sequenz der Positionen 29858 
bis 30400 (ORF 3) der Sequenz gem£fi Figur 1; 

(xv) ss-DNA-Molekiil mit einer Sequenz der Positionen 31220 
bis 32392 (TubA) der Sequenz gemafi Figur 1; 

(xvi) ss-DNA-Molekiil mit einer Sequenz der Positionen 33056 
bis 32397 (ORF 2) der Sequenz gemafi Figur 1; 

(xvii) ss-DNA-Molekiil mit einer Sequenz der Positionen 34195 
bis 33074 (TubZ) der Sequenz gemafi Figur 1; 

(xviii) ss-DNA-Molekiil mit einer Sequenz der Positionen 35422 
bis 34205 (ORF 1) der Sequenz gemafi Figur 1; 

(xix) ss-DNA-Molekul mit einer Sequenz der Positionen 35522 
bis 40147 (TubB) der Sequenz gemafi Figur 1; 

(xx) ss-DNA-Molekul mit einer Sequenz der Positionen 40144 
bis 48021 (TubC) der Sequenz gemafi Figur 1; 

(xxi) ss-DNA-Molekiil mit einer Sequenz der Positionen 48011 
bis 58558 (TubD) der Sequenz gemafi Figur 1; 

(xxii) ss-DNA-Molekul mit einer Sequenz der Positionen 58551 
bis 62096 (TubE) der Sequenz gemafi Figur 1; 

(xxiii) ss-DNA-Molekul mit einer Sequenz der Positionen 62103 
bis 70616 (TubF) der Sequenz gemafi Figur 1; 



(xxiv) 



ss-DNA-Molekiil, das mit einem Molekul gemafi (i), (ii), 
(iii), (iv), (v), (vi), (vii) , (viii) , (ix), (x) , 
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(xi), (xii), (xiii), (xiv) , (xv) , (xvi) , (xvii), 
(xviii), (xix), (xx), (xxi) , (xxii) oder (xxiii) unter 
stringenten Bedingungen hybridisierbar ist und 
insbesondere dieselbe Anzahl von Basen aufweist; und 

(xxv) ss-DNA-Moleklil, das mit einem ss-DNA-Molekiil gemaJi 
(i), (ii), (iii), (iv) , (v) , (vi) , (vii) , (viii) , 
(ix), (x), (xi), (xii), (xiii), (xiv), (xv) , (xvi), 
(xvii), (xviii), (xix), (xx), (xxi), (xxii) oder 
(xxiii) hinsichtlich seiner Nucleotid-Anzahl oder 
seiner Nucleotid-Sequenz zu jeweils 90, 91, 92, 93, 
94, 95, 96, 97, 98, 99 oder 100% homolog ist, jedoch 
von diesem ss-DNA-Molekiil hinsichtlich seiner 
Nucleotid-Anzahl und/oder seiner Nucleotid-Sequenz in 
mindestens einem Nucleotid abweicht; und 

(xxvi) ss-DNA-Molekiil mit einer Sequenz, die zur Sequenz 
eines Molekuls gemaft (i) , (ii) , (iii) , (iv) , (v) , 
(vi) , (vii), (viii), (ix), (x), (xi), (xii), (xiii) , 
(xiv), (xv), (xvi), (xvii), (xviii), (xix), (xx) , 
(xxi), (xxii), (xxiii), (xxiv) oder (xxv) komplementar 
ist . 

Ferner betrifft die Erfindung ein ds-DNA-Molekiil aus einem 
derartigen erf indungsgemaiien ss-DNA-Molekiil und einem dazu 
komplementaren Strang. 

GemafJ einer weiteren Ausf uhrungsf orm betrifft die Erfindung ein 
ss-DNA-Molekul, ausgewahlt aus der folgenden Gruppe: 
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(i) ss-DNA-Molekul mit einer Sequenz der Positioner! 35747 
bis 3 67 69 (Domane C des tubB-Gens) der Sequenz gemafi 
Figur 1; 

(ii) ss-DNA-Molekul mit einer Sequenz der Positionen 37184 
bis 39817 (Domane A des tubB-Gens) der Sequenz gemafi 
Figur 1; 

(iii) ss-DNA-Molekul mit einer Sequenz der Positionen 38369 
bis 397 30 (Domane NMT des tubB-Gens) der Sequenz gemafi 
Figur 1; 

(iv) ss-DNA-Molekul mit einer Sequenz der Positionen 39818 
bis 40069 (Domane PCP des tubB-Gens) der Sequenz gemafi 
Figur 1; 

(v) ss-DNA-Molekul mit einer Sequenz der Positionen 40372 
bis 41397 (Domane C des tubC-Gens) der Sequenz gemafi 
Figur 1; 

(vi) ss-DNA-Molekul mit einer Sequenz der Positionen 41824 
bis 43215 (Domane A des tubC-Gens) der Sequenz gemafi 
Figur 1; 

(vii) ss-DNA-Molekul mit einer Sequenz der Positionen 43216 
bis 434 61 (Domane PCP des tubC-Gens) der Sequenz gemafi 
Figur 1; 



(viii) 



ss-DNA-Molekul mit einer Sequenz der Positionen 43552 
bis 4 4 574 (Domane C des tubC-Gens) der Sequenz gemafi 
Figur 1; 
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(x) 



(xi) 



(ix) ss-DNA-Molekul mit einer Sequenz der Positionen 44980 

bis 4 7 631 (Domane A des tubC-Gens) der Sequenz gemali 
Figur 1; 

ss-DNA-Molekul mit einer Sequenz der Positionen 46153 
bis 47547 (Domane NMT des tubC-Gens) der Sequenz gemali 
Figur 1; 

ss-DNA-Molekul mit einer Sequenz der Positionen 47632 
bis 4 78 68 (Domane PCP des tubC-Gens) der Sequenz gemali 
Figur 1; 

(xii) ss-DNA-Molekul mit einer Sequenz der Positionen 48011 
bis 49321 (Domane KS des tubD-Gens) der Sequenz gemali 
Figur 1; 

(xiii) ss-DNA-Molekul mit einer Sequenz der Positionen 49622 
bis 50584 (Domane AT des tubD-Gens) der Sequenz gemali 
Figur 1; 

(xiv) ss-DNA-Molekul mit einer Sequenz der Positionen 51473 
bis 52309 (Domane KR des tubD-Gens) der Sequenz gemali 
Figur 1; 

(xv, ss-DNA-Molekul mit einer Sequenz der Positionen 53066 

bis 53980 (Domane ER des tubD-Gens) der Sequenz gemali 
Figur 1; 

(xvi) ss-DNA-Molekul mit einer Sequenz der Positionen 54158 

bis 54460 (Domane ACP des tubD-Gens) der Sequenz gemali 
Figur 1; 
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(xvii) ss-DNA-Molekiil mit einer Sequenz der Positionen 54461 
bis 55870 (Domane HC des tubD-Gens) der Sequenz gemaJS 
Figur 1; 

(xviii) ss-DNA-Molekiil mit einer Sequenz der Positionen 56000 
bis 57 412 (Domane A des tubD-Gens) der Sequenz gemaJi 
Figur 1; 

(xix) ss-DNA-Molekul mit einer Sequenz der Positionen 57413 
bis 57643 (Domane PCP des tubD-Gens) der Sequenz gema/J 
Figur 1; 

(xx) ss-DNA-Molekiil mit einer Sequenz der Positionen 58689 
bis 59714 (Domane C des tubE-Gens) der Sequenz gemali 
Figur 1; 

(xxi) ss-DNA-Molekiil mit einer Sequenz der Positionen 60156 
bis 61697 (Domane A des tubE-Gens) der Sequenz gemaJi 
Figur 1/ 

(xxii) ss-DNA-Molekul mit einer Sequenz der Positionen 61698 
bis 61967 (Domane PCP des tubE-Gens) der Sequenz gemafj 
Figur 1/ 

(xxiii) ss-DNA-Molekul mit einer Sequenz der Positionen 62127 
bis 63320 (Domane KS des tubF-Gens) der Sequenz gemafj 
Figur 1; 



(xxiv) 



ss-DNA-Molekiil mit einer Sequenz der Positionen 63711 
bis 64 67 6 (Domane AT des tubF-Gens) der Sequenz gemaB 
Figur 1; 
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(xxv) ss-DNA-Molekiil mit einer Sequenz der Positionen 64 959 

bis 65882 (Domane KR des tubF-Gens) der Sequenz gemafi 
Figur 1; 



(xxvi) 



ss-DNA-Molekiil mit einer Sequenz der Positionen 65985 
bis 67061 (Domane CMT des tubF-Gens) der Sequenz gemafi 
Figur 1; 



(xxvii) ss-DNA-Molekiil mit einer Sequenz der Positionen 67242 
bis 67829 (Domane DH des tubF-Gens) der Sequenz gemafi 
Figur 1; 

(xxviii) ss-DNA-Molekiil mit einer Sequenz der Positionen 68247 
bis 69128 (Domane ER des tubF-Gens) der Sequenz gemafi 
Figur 1; 



(xxix) 



ss-DNA-Molekiil mit einer Sequenz der Positionen 69360 
bis 69605 (Domane PCP des tubF-Gens) der Sequenz gemafi 
Figur 1; 



(xxx) 



ss-DNA-Molekiil mit einer Sequenz der Positionen 69759 
bis 7058 6 (Domane TE des tubF-Gens) der Sequenz gemafi 
Figur 1; 



(xxxi ) 



ss-DNA-Molekiil, das mit einem Molekul gemafi (i) , (ii) , 
(iii), (iv), (v), (vi), (vii), (viii), (±x), (x), 
(xi), (xii), (xiii), (xiv) , (xv), (xvi) , (xvii) , 
(xviii), (xiv), (xx), (xxi), (xxii), (xxiii), (xxiv), 
(xxv), (xxvi), (xxvii), (xxviii), (xxix) oder (xxx) 
unter stringenten Bedingungen hybridisierbar ist und 
insbesondere dieselbe Anzahl von Basen aufweist; 
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(xxxii) 



ss-DNA-Molekul, das mit einem Molekul gemafi (i), (ii), 
(iii), (iv) , (v), (vi), (vii), (viii), (ix) , (x) , 
(xi), (xii), (xiii), (xiv), (xv), (xvi) , (xvii) , 
(xviii), (xiv), (xx), (xxi), (xxii), (xxiii), (xxiv), 
(xxv), (xxvi), (xxvii), (xxviii), (xxix) oder (xxx) 
hinsichtlich seiner Nucleotid-Anzahl oder seiner 
Nucleotid-Sequenz zur jeweils 90, 91, 92, 93, 94, 95, 
96, 97, 98, 99 oder 100% homolog ist, jedoch von 
diesem ss-DNA-Molekiil hinsichtlich seiner Nucleotid- 
Anzahl und/oder seiner Nucleotid-Sequenz in mindestens 
einem Nucleotid abweicht; und 



(xxxiii) ss-DNA-Molekiil mit einer Sequenz, die zur Sequenz 
eines Molekuls gemaft (i), (ii) , (iii), (i v ) , (v) , 
(vi), (vii), (viii), (ix), ( X ), (xi), (xii), (xiii), 
(xiv), (xv), (xvi), (xvii), (xviii), (xiv), (xx), 
(xxi), (xxii), (xxiii), (xxiv), (xxv), (xxvi), 
(xxvii), (xxviii), (xxix), (xxx), (xxxi) oder (xxxii) 
komplementar ist. 



Ferner betrifft die Erfindung ein ds-DNA Molekul aus einem 
derartigen ss-DNA-Molekiil und einem dazu komplementarem Strang. 

Ferner betrifft die Erfindung Varianten oder Mutanten, die aus 
einer Substitution, Insertion oder Deletion von Nucleotiden oder 
einer Inversion von Nucleotid-Segmenten eines erf indungsgemaften 
ss-DNA-Molekiils oder eines erf indungsgemalien ds-DNA-Molekiils 
resultieren, wobei diese Varianten und Mutanten Enzym-Varianten 
oder Enzym-Mutanten fur die Produktion von Sekundarstof f (en) mit 
den eingangs geschilderten und far Tubulysine charakteristischen 
Eigenschaften codieren, insbesondere mit der cytostatischen 
Wirkung. Der Fachmann ist mit Massenscreening vertraut. 
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Ferner betrifft die Erfindung RNA 

(a) mit einer Sequenz entsprechend der eines erf indungsgemalien 
ss-DNA-Molekiils oder 

(b) mit einer Sequenz einer RNA gemaii (a) , aber in 
Gegenrichtung (anti-sense), oder 

(c) mit einer Sequenz einer RNA gemaJi (a) oder (b) und mit 
einem dazu komplementaren Strang, 

jeweils gegebenenfalls als Element eines rekombinanten Vektors. 

Gemafi einer weiteren Ausf uhrungsf orm betrifft die Erfindung 
einen rekombinanten Vektor, insbesondere Expressionsvektor mit 
einem erf indungsgema/len DNA-Molekul. 



eine 



Gemaft einer weiteren Ausf uhrungsf orm betrifft die Erfindung 
Zelle, insbesondere zur Expression, in die ein erf indungsgemaJJes 
DNA-Molekiil oder ein erf indungsgemafler Vektor integriert ist. 

Die erfindungsgemalie Zelle kann sich von kultivierbaren 
Bakterien wie Myxobakterien wie Angiococcus, insbesondere A. 
disciformis, Archangium, insbesondere A. gephyra, Escherichia 
coli, Pseudomonaden oder Actimomyceten herleiten. 

Gemafi einer weiteren Ausf uhrungsf orm betrifft die Erfindung eine 
Verwendung eines erf indungsgemaflen Vektors fur die 
Transformation von Zellen oder Organismen zur transienten oder 
permanenten Expression eines oder mehrerer Proteine 
(Expressionsprodukt(e)), das (die) durch eine DNA (ssDNA oder 
dsDNA) des Vektors codiert wird (werden) . 



GemaB eine weiteren Ausf uhrungsf orm betrifft die Erfindung ei 
Verwendung einer erf indungsgemalien Zelle zur enzymatischen 



ne 
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Biosynthese, Mutasynthese Oder Partial-Synthese eines 
Tubulysins, insbesondere Tubulysin A, B, C, D, E und/oder F. 



Gemafl einer weiteren Ausf uhrungsf orm betrifft die Erfindung 
Expressionsprodukt eines erf indungsgemafien DNA-Molekiils oder 
eines erf indungsgemaiJen Vektors oder einer erf indungsgemafien 
Zelle. 



em 



Die vorliegende Erfindung betrifft insbesondere ein 
Polynucleotid, das eine Sequenz wie in SEQ ID NO: 1, 18, 33 oder 
36 definiert, oder ein Fragment davon enthalt. SEQ ID NO: 1 und 
18 beschreiben den ( + ) - bzw. den (-) -Strang des Tubulysin 
Biosynthese-Clusters von Angiococcus discif ormis . SEQ ID NO: 33 
ist eine Sequenz, die mehrere sich iiberlappende Gene des 
Clusters enthalt. SEQ ID NO: 36 beschreibt eine Mutante von 
Angiococcus discif ormis . Uberraschenderweise wurde gefunden, 
dass diese Mutante ein Vielfaches der Tubulysin D-Produktion des 
Wildtyps zeigte. Die Tubulysin-Oberexpression ist auf die 
Gesamtwirkung aller Tubulysin-Derivate bezogen sogar noch hoher 
als die von Tubulysin D, was in keinsterweise zu erwarten war. 
Die Gene der SEQ ID NO: 36 sind offenbar an der negativen 
Regulation der Tubulysin Expression beteiligt. Diese Mutante ist 
auf Grund der erhShten Expression aller Tubulysine besonders 
geeignet zur Produktion der erf indungsgemafien Polypeptide. 
Antikdrper gegen die Wildtyp-Expressionsprodukte dieser Sequenz 
kdnnen eingesetzt werden, um deren negativen Einfluss auf die 
Tubulysin Produktion auch in anderen Stammen zu minimieren. 
Einen ahnlichen Effekt hben auch antisense-RNA bzw RNAi- 
Techniken, die mit der Wildtyp Sequenz der negativen 
Regulatorgene interagieren. 
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Die Fragmente des Polynucleotid konnen jede beliebige 
Teilsequenz und Lange haben, bevorzugt sind jedoch solche 
Fragmente, die Proteine codieren. Die Polynucleotide der 
vorliegenden Erfindung schlieiien ebenfalls ein, sind jedoch 
nicht beschrankt auf, ein Polynucleotid, das an den 
komplementaren Strang der offenbarten Nucleotidsequenzen unter 
moderat stringenten oder stringenten Bedingungen hybridisiert ; 
ein Polynucleotid, das eine Allel-Variante irgendeines oben 
beschriebenen Polynucleotids ist; ein Polynucleotid, das ein 
Spezies-Homolog irgendwelcher der hier offenbarten Proteine 
codiert; oder ein Polynucleotid, das ein Polypeptid codiert, das 
eine zusatzliche spezifische Domane oder eine Trunkierung bzw. 
Verkurzung der offenbarten Proteine aufweist. 

Der Begriff "CDS" bzeichnet eine Sequenz von Nucleotiden, die 
der Sequenz von Aminosauren in einem Protein entspricht, also 
die Aminosauren-codierenden Sequenzbereiche, inklusive des 
jeweiligen Start- und Stoppcodons . 

In einer bevorzugten Ausf uhrungsf orm ist das erf indungsgemafle 
Polynucleotid ein Fragment, das eine im Sequenzprotokoll 
definierte CDS ist. 

Die vorliegende Erfindung betrifft weiterhin einen Vektor, der 
ein Polynucleotid wie oben beschrieben enthalt. Vektoren fur 
verschiedene Zwecke sind im Stand der Technik bekannt, ebenso 
die Techniken zur Subklonierung von Polynucleotiden in solche 
Vektoren. Diese werden beispielsweise in der neuen Ausgabe von 
Molecular Cloning: A Laboratory Manual, (Sambrook et al . , (1989) 
Molecular Cloning: A Laboratory Manual, 2nd ed. , Cold Spring 
Harbor Laboratory Press); DNA Cloning, Volumes I and II (D. N. 
Glover ed., 1985); Gene Transfer Vectors for Mammalian Cells 
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(Miller & Calos, eds.); Current Protocols in Molecular Biology 
and Short Protocols in Molecular Biology, 3rd Edition (F. M. 
Ausubel et al . , eds.); Recombinant DNA Methodology (R. Wu ed., 
Academic Press) oder "A Practical Guide To Molecular Cloning" 
beschrieben. Beispiele far Vektoren finden sich unter anderen in 
Gene Transfer Vectors For Mammalian Cells (J. H . Miller and M. 
P- Calos eds., 1987, Cold Spring Harbor Laboratory). 

Bei dem Vektor handelt es sich vorzugsweise urn einen 
Expressionsvektor, also im Allgemeinen ein Plasmid, einen 
Phagen, ein Virus oder einen Vektor zum Exprimieren eines 
Polypeptids aus einer DNA (RNA) Sequenz. Ein Expressionsvektor 
kann eine Transkriptionseinheit umfassen, die eine Anordnung des 
Folgenden aufweist: (1) ein genetisches Element oder Elemente 
mit einer regulatorischen Rolle in der Genexpression, 
beispielsweise Promotoren oder Enhancer, (2) eine 
Struktursequenz oder codierende Sequenz, die in mRNA 
transkribiert und in ein Protein translatiert wird und (3) 
geeignete Transkriptionsstart- und -terminationssequenzen. 
Struktureinheiten, die zur Verwendung in Hefen oder 
eukaryontischen Expressionssystemen vorgesehen sind, schlieiben 
vorzugsweise eine Leadersequenz ein, die die extrazellulare 
Sekretion eines translatierten Proteins durch einen Wirt 
ermoglicht. Es kann alternativ, wenn ein rekombinantes Protein 
ohne eine Leader- oder Transportsequenz exprimiert wird, einen 
N-terminalen Methionin-Rest einschliefien . Dieser Rest kann, muss 
aber nicht anschliellend von dem exprimierten rekombinanten 
Protein abgespalten werden, urn das Endprodukt zu erhalten. 

Des Weiteren betrifft die vorliegende Erfindung eine Zelle, die 
einen solchen Vektor enthalt. Der Vektor kann aber die bekannten 
Techniken wie beispielsweise Transf ektion, Elektroporation, 
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Lipofektion usw. in die Zelle eingebracht werden. Bei viralen 
Vektoren ist auch eine Infektion mSglich. B ei den Zellen kann e 
sich um eukaryotische als auch prokaryoti Sc he Zellen handeln. 
Auch die Methoden zur Selektion und Propagierung der Zellen, di 
den Vektor enthalten sind dem Fachmann bekannt. Beispiele fur 
die Kultivierung von Zellen tierischen Ursprungs sind u.a. in 
Culture Of Animal Cells (R. I. Freshney, Alan R. Liss, Inc., 
1987) zu finden. 

Eine weitere Ausf uhrungsf orm betrifft ein Polypeptid, das 
wenigstens eine Sequenz wie in SEQ ID NO: 2 bis 17, 19 bis 32, 
34, 35, 37 und/oder 38 definiert und/oder ein Fragment und/oder 
Derivat davon enthalt. Das Polypeptid kann sowohl durch 
Expression eines Polynucleotids als auch durch chemische 
Synthese bereitgestellt werden. 



sen 



Die Aminosauresequenzen der vorliegenden Erfindung umfas 
ebenfalls alle Sequenzen, die sich von den hier offenbarten 
Sequenzen durch Aminosaureinsertionen, -deletionen und - 
substitutionen unterscheiden . 

Aminosaure- „Substitutionen« sind vorzugsweise das Ergebnis des 
Ersetzens einer Aminosaure durch eine andere Aminosaure mit 
ahnlichen strukturellen und/oder chemischen Eigenschaf ten, das 
heilit konservative Aminosaure-Austausche . Aminosaure- 
Substitutionen konnen auf der Grundlage einer Ahnlichkeit in der 
rolaritat, Ladung, L6slichkeit, Hydrophobie, Hydrophilie 
und/oder der amphipatischen Natur der einbezogenen Reste 
vorgenommen werden. Beispielsweise schliefien unpolare 
(hydrophobe) Aminosauren Alanin, Leucin, Isoleucin, Valin, 
-rclin, Phenylalanin, Tryptophan und Methionin ein; polare 
neutrale Aminosauren schlieften Glycin, Serin, Threonin, Cystein, 



WO 2004/022586 



PCT/EP2003/009780 



17 



Tyrosin, Asparagin und Glutamin ein; positiv geladene (basische) 
Aminosauren schlieBen Arginin, Lysin und Histidin ein; und 
negativ geladene (saure) Aminosauren schlieAen Asparaginsaure 
und Glutaminsaure ein, 

"Insertionen" Oder "Deletionen" bewegen sich typischerweise im 
Bereich von 1-3 Aminosauren. Die erlaubte Variation kann 
experimentell bestimmt werden, indem systematisch Insertionen, 
Deletionen oder Substitutionen von Aminosauren in einem 
Polypeptidmolekul unter Verwendung von DNA- 

Rekombinationstechniken vorgenommen und die sich ergebenden 
rekombinanten Varianten beztiglich ihrer Aktivitat untersucht 
werden. Dazu ist far den Fachmann nicht mehr als die 
Durchfuhrung von Routineexperimenten erf orderlich . 

Beispielsweise kann das Polypeptid auch in Form eines chimaren 
Polypeptids, codiert durch ein Fusionsgens, vorliegen, dass 
wenigstens eine weitere Sequenz enthalt. Diese zusatzliche 
Sequenz kann z.B. dazu dienen, die Aufreinigung des 
Expressionsprodukts zu erleichtern oder dem Expressionsprodukt 
eine zusatzlich Funktion zu verleihen. 

Beispiele fur zusatzliche Sequenzen, die die Aufreinigung 
erleichtern sind sogenannte Tags, die dem Fachmann bekannt sind, 
wie z.B. das his-tag. 

Die vorliegende Erfindung betrifft auiierdem die Verwendung 
wenigstens eines Polynucleotids wie in SEQ ID NO: 1, 18, 33 
und/oder 36 definiert und/oder wenigstens eines Fragments davon 
und/oder wenigstens eines Polypeptids wie in SEQ id NO: 2 bis 
17, 19 bis 32, 34, 35, 37 und/oder 38 definiert und/oder 
wenigstens eines Fragments davon zur Herstellung einer 
Pharmazeutischen Zusammenset zung zur Behandlung von 
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unerwiinschtem Zellwachstums oder unerwtinschter Zellvermehrung in 
einem Individuum. Die Zusammenset zung kann beispielsweise einen 
geeigneten Vektor enthalten, zusammen mit Hilf sf aktoren, die die 
Expression eines Tubulysins, vorzugsweise in den unerwunschten 
Zellen, ermoglichen und dadurch ein weiteres Wachstum bzw. eine 
weiter Vermehrung dieser Zellen verhindern. Die Zusammenset zung 
kann auch erf indungsgemafte Zellen enthalten, die mit einem 
Vektor, beispielweise einen Tubulysin-exprimierenden Vektor, 
transfiziert wurden. 

In einer bevorzugten Ausf uhrungsf orm ist das unerwunschte 
Zellwachstum oder die unerwunschte Zellvermehrung ein Tumor. Der 
Tumor kann sowohl eine gutartige als auch eine bosartige 
Geschwulst sein. 

In einer weiteren Ausf uhrungsf orm ist das unerwunschte 
Zellwachstum eine pathogene Infektion. Das Pathogen kann hierbei 
sowohl einzellig als auch mehrzellig sein. Dies schlielit auch 
Infektionen mit Pilzen, wie beispielsweise Candida oder 
Aspergillus, und Infektionen mit Parasiten, wie beispielweise 
Trypanosomen oder Schistosomen ein. In einer bevorzugten 
AusfUhrungsform der Verwendung ist die pathogene Infektion eine 
Mykose, Malaria oder eine parasitare Erkankung. 

Die Erfindung betrifft ferner eine pharmazeutische 
Zusammensetzung, enthaltend wenigstens ein Polynucleotid wie in 
SEQ ID NO: 1, 18, 33 und/oder 36 definiert und/oder wenigstens 
ein Fragment davon und/oder wenigstens ein Polypeptid wie in SEQ 
ID NO: 2 bis 17, 19 bis 32, 34, 35, 37 und/oder 38 definiert 
und/oder wenigstens ein Fragment davon. Die Zusammenset zungen 
enthalten eine therapeutisch wirksame Menge bzw. Dosis des 
jeweiligen Wirk- bzw. Inhaltsstof f es . Eine therapeutisch 
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wirksame Dosis betrifft diejenige Menge der Verbindung, die 
ausreicht, urn eine Linderung der Symptome, beispielsweise eine 
Behandlung, Heilung, Pravention oder Linderung derartiger 
Zustande, insbesondere die Hemmung oder Verhinderung von 
unerwunschtem Zellwachstum und Zellvermehrung in einem Patienten 
zu ergeben. Geeignete Verabreichungswege schlieften 
beispielsweise eine parenteral Verabreichung, einschliefilich 
intramuskularer und subkutaner Injektionen, ebenso wie 
intrathekaler, direkt intraventrikularer, intravenoser, 
intraperitonealer Injektionen ein. 

in einer weiteren Ausf uhrungsf orm enthalt diese pharmazeutische 
Zusammensetzung wenigstens einen pharmazeutisch vertraglichen 
Tragerstoff. Eine solche Zusammensetzung kann weiterhin 
(zusatzlich zum Inhaltsstoff und dem Trager) Verdunnungsmittel , 
Fullmittel, Salze, Puffer, Stabilisatoren, Losungsvermittler und 
andere Materialien enthalten, die im Stand der Technik wohl 
bekannt sind. Der Begriff "pharmazeutisch vertraglich" bedeutet 
ein nicht-toxisches Material, das die Wirksamkeit der 
biologischen Aktivitat des aktiven Inhaltsstoff (der aktiven 
Inhaltsstoff e) nicht stort. Die Eigenschaf ten des Tragers hangen 
vom Verabreichungsweg ab. Die therapeutische Zusammensetzung 
kann weiterhin weitere Mittel bzw. Wirkstoffe enthalten, die die 
Aktivitat bzw. Wirksamkeit oder Anwendung bei Behandlung 
verbessern bzw. erleichtern. Solche zusatzlichen Faktoren 
und/oder Mittel konnen in der therapeutischen Zusammensetzung 
enthalten sein, urn eine synergistische Wirkung zu erzeugen oder 
urn Nebenwirkungen zu minimieren. 

Techniken zur Formulierung, Zubereitung und Verabreichung der 
Verbindungen der vorliegenden Erfindung sind in "Remington's 
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Pharmaceutical Sciences", Mack Publishing Co., Easton, PA, 
neueste Ausgabe, zu finden. 

Die vorliegende Erfindung betrifft aufterdem ein Verfahren zur 
Herstellung von Tubulysinen und Tubulysin-Biosynthese Proteinen, 
umfassend die Schritte: 

(a) exprimieren von wenigstens einem Polynucleotid wie in 
SEQ ID NO: 1, 18, 33 und/oder 36 definiert und/oder 
wenigstens einem Fragment davon und/oder wenigstens 
einem Polypeptid wie in SEQ ID NO: 2 bis 17, 19 bis 32, 
34, 35, 37 und/oder 38 definiert und/oder wenigstens 
einem Fragment davon, und 

(b) aufreinigen der Expressionsprodukte . 

Methoden zur Expression von Proteinen sind dem Fachmann bekannt 
und konnen der einschlagigen Literatur entnommen werden, 
beispielweise aus Methods In Enzymology, Vols. 154 and 155 (Wu 
et al. eds.) oder Recombinant DNA Methodology (R. Wu" ed. , 
Academic Press). Fur das Aufreinigen von Expressionsprodukten 
sind dem Fachmann eine Vielzahl von Methoden bekannt. Neben 
chromatographischen Methoden wie beispielsweise 

Affinitatschromatographie oder HPLC konnen auch immunologische 
Verfahren wie beispielsweise immobilisierte Antikorper gegen ein 
Epitop auf dem Expressionsprodukt , z.B. einem His-tag, zum 
Aufreinigen der Produkte verwendet werden. 

In einer bevorzugten Ausf uhrungsf orm erfolgt die Expression in 
prokaryotischen oder eukaryotischen Zellen und/oder durch in 
vitro Expression. Die Expression von Polypeptiden in 
prokaryotischen oder eukaryotischen Zellen ist ein haufig 
angewandtes Verfahren und wird im Allgemeinen mittels eines 
Expressionsvektors, wie oben beschrieben, erreicht. Fur die in 
vitro Expression sind ebenf alls bereits Vektoren beschrieben. 



WO 2004/022586 



PCT7EP2003/009780 



21 



Diese und die notwendigen Faktoren sind kommerziell in Form von 
Kits erhaltlich, beispielsweise von BioRad, Stratagene, 
Invitrogen und Clontech. 

Desweiteren stellt die Erfindung ein Verfahren zum Auffinden von 
Genen, die an der Biosynthese von Tubulysinen beteiligt sind, 
bereit. Das Verfahren umfasst die folgenden Schritte: 

(a) hybridisieren von wenigstens einem Polynucleotid wie in 
SEQ ID NO: 1, 18, 33 und/oder 36 definiert und/oder 
wenigstens einem Fragment davon mit DNA, RNA, und/oder 
cDNA einer Spezies, die nicht mit Angiococcus 
disciformls identisch ist, und 

(b) isolieren und charakterisieren der hybridisierten DNA, 
RNA, und/oder cDNA. 

Die Hybridisierung kann unter unterschiedlich stringenten 
Bedingungen durchgefUhrt werden. 

Die Stringenz der Hybridisierung, wie hierin verwendet, betrifft 
Bedingungen, unter denen Polynucleotid-Doppelstrange stabil 
sind. Wie dem Fachmann bekannt ist, ist die Stability eines 
Doppelstranges eine Funktion der Natriumionenkonzentration und 
der Temperatur (siehe beispielsweise Sambrook et al., Molecular 
Cloning: A Laboratory Manual 2 nd Ed. (Cold Spring Harbor 
Laboratory, (1989)). Die Stringenzniveaus, die zur 
Hybridisierung verwendet werden, konnen vom Fachmann leicht 
abgewandelt werden. 

Der Begriff „schwach stringente Hybridisierung" bezeichnet 
Bedingungen, die einer Hybridisierung in 10% Formamid, 5 x 
Denharts Losung, 6 x SSPE, 0,2 % SDS bei 42°C, gefolgt von 
Waschen in 1 x SSPE, 0,2 % SDS bei 50°C Equivalent sind 
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Denhart^s Losung und SSPE sind dem Fachmann genauso wie andere 
geeignete Hybridisierungspuf f er wohl bekannt. 

Eine „moderat stringente Hybridisierung" bedeutet Bedingungen, 
die es der DNA erlauben, an eine koiuplementare Nucleinsaure zu 
binden, die ungefahr 60 % Identitat, vorzugsweise ungefahr 75 % 
Identitat, besonders bevorzugt ungefahr 85 % Identitat zu dieser 
DNA aufweist; wobei eine Identitat von mehr als ungefahr 90 % zu 
dieser DNA besonders bevorzugt wird. Moderat stringente 
Bedingungen sind vorzugsweise Bedingungen, die eine 
Hybridisierung in 50 % Formamid, 5 x Denharts Losung, 5 x SSPE, 
0,2 % SDS bei 42°C gefolgt von Waschen in 0,2 x SSPE, 0,2 % SDS 
bei 65°C Equivalent sind. 

Hochstringente Hybridisierung bedeutet Bedingungen, die die 
Hybridisierung nur von solchen Nucleinsauresequenzen 
ermoglichen, die in 0,018 M NaCl bei 65°C stabile Doppelstrange 
bilden (d.h., wenn ein Doppelstrang in 0,018 M NaCl bei 65°C 
nicht stabil ist, ist er unter den hier 

beschriebenen/def inierten hochstringenten Bedingungen nicht 
stabil) . 

Weiterhin konnen Nucleinsaure-Hybridisierungstechniken verwendet 
werden, um eine Nucleinsaure zu identif izieren und zu gewinnen, 
die von der vorliegenden Erfindung urufasst ist. Kurz gesagt kann 
jede Nucleinsaure mit einer gewissen Homologie zu einer in 
dieser Erfindung offenbarten Sequenz oder einem Fragment davon, 
als Sonde zur Identif izierung einer ahnlichen Nucleinsaure durch 
Hybridisierung unter moderat stringenten bis hochstringenten 
^eamgungen verwendet werden. Solche ahnlichen Nucleinsauren 
::.".r.T.p.r. dann isoliert, sequenziert und analysiert werden, um zu 
B&stinmen, ob sie von der vorliegenden Erfindung umfasst werden. 
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Die vorliegende Erfindung stellt aulierdem einen Kit zur 
Herstellung von Tubulysinen, enthaltend: 

(a) wenigstens ein Polynucleotid, enthaltend eine Sequenz 
wie in SEQ ID NO: 1, 18, 33 oder 36 definiert Oder ein 
Fragment davon und/oder wenigstens einen Vektor der ein 
solches Polynucleotid enthalt 

oder 

(b) geeignete Medien und Puffer zur Vermehrung von Zellen, 
die eine Expression des Polynucleotids und/oder Vektors 
erlauben, und 

(c) geeignete Mittel zum Aufreinigen des/der 
Expressionsprodukte (s) 

bereit . 

Auf Grund ihrer Wirkung auf das Tubulin-Skelett und ihrer 
Cytotoxizitat, insbesondere bei Pilzen, eignen. sich Tubuline 
auch als Desinfektionsmittel, das eine Kontamination mit 
Tubulin-enthaltenden Zellen abbauen bzw. verhindern kann. 
Die Erfindung betrifft deshalb auch die Verwendung einer 
Zusammensetzung enthaltend wenigstens ein Polypeptid wie in SEQ 
ID NO: 2 bis 17, 19 bis 32, 34, 35, 37 und/oder 38 definiert 
und/oder wenigstens ein biologisch aktives Fragment oder Derivat 
davon als Desinfektionsmittel. Neben dem oben definierten 
Polypeptid konnen im Desinfektionsmittel auch andere 
desinfizierend wirkende Stoffe enthalten sein, solange sie die 
Wirkung des erf indungsgemaiien Polypeptids nicht inhibieren. 
Auflerdem kann das Desinfektionsmittel weitere Hilfsstoffe wie 
beispielsweise Puffer, Wasser, Farbstoffe, Duftstoffe, 
Stabilisatoren, Tragerstoffe usw. enthalten. 

In einer bevorzugten Ausf tihrungsf orm ist die Zusammensetzung 
fltissig oder pulverfarmig . 
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Entsprechend betrifft die Erfindung auch Desinf ektionsmittel wie 
oben definiert. 

Soweit keine anderen Definitionen angegeben sind, weisen alle 
technischen und wissenschaf tlichen Begriffe, die hierin 
verwendet werden, dieselbe Bedeutung auf, wie sie tiblicherweise 
vom Fachmann auf dem Gebiet, an den sich diese Erfindung wendet, 
verstanden werden. Alle Verof f entlichungen, Patentanmeldungen, 
Patente und anderen hierin erwahnten Referenzen sind durch 
Bezugnahme in ihrer Gesamtheit mit aufgenommen. Im Falle eines 
Konfliktes wird jedoch die vorliegende Beschreibung 
einschliefilich der Definitionen entscheiden. Zusatzlich sind die 
Materialien, Methoden und Beispiele lediglich veranschaulichend 
und sollen nicht als einschrankend aufgefasst werden. 
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1. Identif izierung des Tubulysin-Biosyntheseclusters in 
Anglococcus dlsclformis An d48 durch mariner Transposon- 
Mutagenese mittels pMycoMar 

Die Identifizierung des Tubulysin-Biosyntheseclusters wurde 
durch die Erstellung einer Transposon-Mutantenbank aus 
Angiococcus disciformis An d48 mittels pMycoMar durchgef uhrt . 
Rubin & Mekalanos (Proc. Natl. Acad. Sci. USA, 96 (1999), 1645 - 
1650) entwickelten aus dem mariner Element Himarl das Plasmid 
pMycoMar, welches ein einfaches Transpositonssystem darstellt, 
das effizient Bakterien in vivo infizieren und Insertions- 
Mutanten generieren kann. Dieses Plasmid enthalt das Mini- 
Transposon magellanA, bei dem das Tn5 Kanamycin-Resistenzgen und 
der oriR6K von den inverted repeats des Himarl flankiert sind. 
Zusatzlich ist die Himarl Transposase unter der 
transkriptionalen Kontrolle des T6 Promotors in das 
mycobakterielle Temperatur-sensitive Replicon pPR23 kloniert 
worden. Das pMycoMar codiert ebenfalls ein Gentamycin- 
Resistenzgen . 

Das Himarl zeichnet sich bei der Transposition durch eine TA 
Dinucleotid-Erkennungssequenz aus. Daher kann es zufallig in ein 
Wirts-Genom integrieren und statistisch gesehen alle aktiven 
Gene durch eine Insertions-Mutation ausschalten. Aufgrund dieser 
Tatsache sollte eine Mutantenbank aus An d48 generiert und das 
Tubulysin-Biosynthesecluster durch eine Knockout-Mutant e 
identif iziert werden . 

Alternativ kann man auch von Archangium gephyra DSM 11092 
ausgehen und nach einem Protokoll von Biozym Diagnostic 
(Oldendorf, DE; Katalog TSM99K2; pEZ: :TN<KAN-2> Tnp Transposome- 
Kit) arbeiten. 
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1 . 1 Generierung der Mutantenbank 

Fur eine Elektrotransf ormation von A. disciformis An d48 warden 
zwei verschiedene Protokolle verwendet. Diese Protokolle wurden 
fur die Myxobakterien Stigmatella aurantiaca (Stamm & Plaga, Arch. 
Mircobiol., 172 (1995), 483 - 494) und Myxococcus xanthus (Kashefi 
& Hartzell, Mol . Microbiology, 15 (1995) 483 - 494) etabliert. Die 
beiden Methoden zeigten in der Transf ormationsef f izienz des 
A. disciformis An d48 keinen Unterschied, so class die 
Elektrotransformation zur Erstellung der Transposonbank nach dem 
Protokoll fur Stigmatella aurantiaca durchgefuhrt wurde. Im 
folgenden sind die beiden Protokolle aufgefuhrt. 

l.l.i. Elektrotransformation von Axigiococcus disciformis An d48 
nach Stigmatella. aurantiaca-Protokoll 

Eine in 50 ml Tryptonmedium (10 g Trypton; 2 g MgS04 2 . 0, 1 % 
Vitamin B12 [10 ng/ml]; 0,2 % Glucose auf 1 1 Medium;' P H7, 2) 
gewachsene A. disciformis Kultur wird bis 2 * 10 8 Zellen / ml bei 
30 °C kultiviert. Ausgehend von einer Generationszeit von 6 
Stunden, wurde diese Kultur am Vortag so angeimpft, dass 
rechnerisch diese Zelldichte erreicht wird. Die Kultur wird bei 
20°C abzentrifugiert (20min; 4000rpm) und die Zellen im gleichen 
Volumen Waschpuffer (5 mM HEPES/NaOH, 0,5 mM CaCl 2; P H "7,2) 
resuspendiert. Nach erneuter Zentrif ugation wird in 25 ml Puffer 
resuspendiert und erneut abzentrifugiert. Vor diesem 
Zentrifugationsschritt wird die absolute Zellzahl innerhalb der 25 
ml bestimmt, so dass rechnerisch 1 * i 0 9 Zellen/ 40 pi 
resuspendiert werden. 
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Elektropor ationsbedinaunaen : 

1-3 pg DNA und 4 0 ul Zellsuspension werden gemischt und in eine 
auf Eis gekuhlte Elektroporationskuvette (0,1 cm) gegeben. Die 
Elektroporation wird bei 200 Q, 25 mF und 0,85 kV / cm 
durchgefuhrt. 

Direkt nach der Elektroporation wird 1 ml Tryptonmedium zugegeben. 
Nach Transfer in 50 ml Tryptonmedium werden die Zellen fur eine 
phanotypische Expression 6 h bei .30°C geschiittelt. Danach wird die 
Kultur abzentrifugiert (20 min, 4000 rpm, 20°C) und in 1 ml 
Trypton resuspendiert . Ausgehend von einer 100 %-igen 
Oberlebensrate der Zellen wird eine Verdunnungsreihe erstellt und 
1*10 8 - 1*10 4 Zellen werden mit 3 ml Trypton-Sof tagar auf 
Kanamycin haltigen (50 ug / ml) Tryptonplatten plattiert. Diese 
Flatten werden bei 30 °C inkubiert und nach 5 - 8 Tagen sind die 
ersten Klone zu sehen. 

1.1.2. Elektrotransformation von Anglococcus dlscifoxmls An d48 
nach Myxococcus xanthus-Protokoll 

Die Wachstumsbedingungen der Vor- und Hauptkultur sowie 
Zentrifugationen und das abschliefiende Einengen der Zellzahl 
wurden genauso durchgefuhrt wie unter l.l.i. angegeben ist. Dies 
ist abweichend vom Standardprotokoll fur Myxococcus xanthus 
optimiert worden. 

Elektroporationsbedinaunaen : 

1-3 ug DNA und 40 ul Zellsuspension werden gemischt und in eine 
auf Eis gekuhlte Elektroporationskuvette (0,1 cm) gegeben. Die 
Elektroporation wird bei 400 Q, 25 uF und 0,65kV / cm 
durchgefuhrt . 
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Dxrekt nach der Elektroporation wird 1 m l Tryptonmedium zugegeben 
und in einem 1,5 ml Eppendorf-Reagenzgef aft far 6 h bei 30°C 
geschuttelt. Ausgehend von einer 100%-igen Oberlebensrate der 
Zellen wird eine Verdiinnungsreihe erstellt und 1*10 8 - l*io 4 
Zellen mit 3 ml Trypton-Sof tagar auf Kanamycin haltigen (50 ug / 
ml) Tryptonplatten plattiert. Diese Flatten werden bei 30°C 
inkubiert und nach 5 - 8 Tagen sind die ersten Klone / Mutanten zu 
sehen, die mittels Impfose gepickt wurden. 

1.2 Kultivierung der Transposonmutanten 

Die Mutanten wurden in 96'er Mikrotiter-Platten in 200 pi M7- 
Medium (5 g Probion; 1 g C aCl 2 ; 1 g M gS04 4 ; lg Hefeextrakt; 5 
g Starke; 10 g HEPES ; 0,1 % Vitamin B12 [10 ng / ml] auf 1 1 
Medium; pH7,4) bei 32°C inkubiert und nach 10 Tagen eine Kopie 
der gesamten Bank erstellt. Dafur wurden 50 ul Kultur jeder 
Mutante in neue Mikrotiter-Platten mit 100 pi M7-Medium 
transferiert. Nach weiteren sieben Tagen Inkubation wurde eine 
Kopie bei -80 °C als Dauerkulturen eingefroren. Die verbleibende 
Kopie der Bank wurde extrahiert, der Extrakt mittels 
Toxizitatstest auf generierte Tubulysin Knockout-Mutanten 
untersucht . 

Bei der Identif izierung von Mutanten, die in dieser Analytik 
Veranderungen zum Wildtyp zeigten (keine Zellkern- 
f ragmentierung) , wurden diese aus der Dauerkultur rekultiviert . 
Zur Kontrolle der erzielten Ergebnisse sollten 50 ml M7-Medi 
GroBkulturen der entsprechenden Mutanten erneut getestet werden. 
Bei eventuellen Tubulysin Knockout-Mutanten wurden die Extrakte 
zunachst tiber einen HPLC-Lauf fraktioniert und anschlieliend die 
Fraktionen mittels Toxizitatstest auf Tubulysin untersucht. 
r-jrch die vorhergehende Fraktionierung wird eine Maskierung der 
luouiysin-Wirkung durch Myxothiazol vermieden. Da die beiden 
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Sekundarmetabolite verschiedene Retentionszeiten bei der Elution 
von einer C-14 Saule besitzen, liegen beide im folgenden 
Toxizitatstest in unterschiedlichen Fraktionen vor. 

1 . 3 Toxizitatstest 

Die Mini-Kulturen aus den 96 'Mikrotiterplatten wurden nach 
Kultivierung durch Stickstof f -Begasung auf einem Heizblock bei 
37 °C eingetrocknet . Danach wurden die Zellpellets uber 2 h in 
lOOpl Methanol resuspendiert und jeweils 10 pi fur den folgenden 
Toxizitatstest eingesetzt, urn eine Tubulysin-Produktion der 
jeweiligen Mutante detektieren zu konnen. 

Fur diesen Test werden L929 Zellen in DMEM-Medium (Invitrogen, 
Groningen) bei 37 °C kultiviert und danach mittels Zellschaber 
vorsichtig geerntet. Diese Zellsuspension wird anschliefiend 1 : 
10 mit DMEM verdunnt und 120 pi pro Offnung einer 
96'Mikrotiterplatte verteilt. Diese werden anschliefiend mit den 
10 pi Zellextrakt der einzelnen Transposon-Mutanten versetzt und 
fur funf Tage bei 37 °C inkubiert. Nach dieser Inkubationszeit 
werden die L929 Zellen mikroskopisch auf Zellkernf rakmentierung 
untersucht, was ein Zeichen fur Tubulysineinwirkung ist. Bei 
Zellen, die keine Zellkernf ragmentierung zeigten, wurden die 
entsprechenden Mutanten als mutmaliliche Tubulysin Knockout- 
Mutanten identif iziert . Die Extrakte dieser Mutanten wurden in 
50 ml M7-Medium ( + 1ml Absorber-Har z XAD-16 von Rohm & Haas) 
angezogen und die Zellkerne der L929-Zellen nach durchgef uhrtem 
Toxizitatstest zusatzlich durch eine Anfarbung des Chromosoms 
mittels DAPI-Farbung auf eine Zellkernf ragmentierung bzw. 
Tubulysin-Produktion untersucht. 
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1 . 4 Bestiromung des Integrationsgenortes der Tubulysin Knockout- 
Mu tan ten in der An d48 mariner vermittelten Mutantenbank 
mittels Transposon-recovery 

In der generierten Mutantenbank konnten mittels Toxizitatstest 
funf Mutanten identif iziert werden (MutT176, 524, 781, 794 und 
929), die kein Tubulysin produzierten . Dieses Ergebnis konnte 
nach Rekultivierung der Mutanten aus der Dauerkultur und 
erneuter Analyse bestatigt werden. Um Inf ormationen zu erhalten, 
in welchen Bereich des Genoms das Himarl Element transponiert 
ist, wurde ein Transposon-recovery durchgef iihrt . Bei dieser 
Methodik wird die chromosomale DNA der jeweiligen Mutante mit 
verschiedenen Restriktionsenzymen geschnitten, die nicht 
innerhalb der bekannten magellan4 Sequenz schneiden. Die 
restringierte DNA wird ligiert, nach Transformation in 
DH5ctApir-Zellen wird auf Kanamycin haltigen LB-Platten bei 37 °C 
inkubiert. Auf diesen Platten konnen nur solche E.coli Zellen 
wachsen, die ein Plasmid mit dem magellan4 und somit dem Tn5 
Kanamycin-Resistenzgen enthalten. Solch ein Plasmid ' enthalt an 
den Enden des Transposons chromosomale DNA aus An d48. Diese 
Plasmide k6nnen in den E.coli Zellen DH5a/Xpir replizieren, da 
innerhalb der Transposon Sequenz der oriR6K sitzt. Das 
Transposon wurde so aus dem jeweiligen Genom isoliert und mit 
den Primern K38 8 und K38 9 ansequenziert . Die erhaltenen 
Sequenzen wurden dann gegen die Genbank auf Homlogien zu 
bekannten Genen hin untersucht und zeigten dabei hohe 
Ahnlichkeiten zu nicht ribosomalen Peptidsynthetasen (NRPS) aus 
bekannten Sekundarmetabolit-Biosynthesegenclustern wie denen des 
Myxothiazols, Nostopeptolids und Saframycins. Diese Analysen 
v/aren eindeutige Hinweise dafttr, dass es sich dabei um 
Sequenzfragmente aus dem gesuchten Tubulysin-Gencluster handet. 
Durch Restriktions- und Southernanalysen wurde die Grofie der 
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einzelnen Transposon-Plasmide und ihre relativen 

Integrationsorte zueinander (innerhalb des Genclusters) 
bestimmt . 

1.4.1. Transposon-recovery 

Isolieren chromosomaler DNA nach Standardprotokollen aus 50 ml 
Tryptonmedium-Kultur jeder A. disciformis An d4 8 Mutante. Von 
dieser DNA werden 5 pg fur die folgende Ausklonierung des 
Transposons verwendet, indem zunachst eine Restriktion 
durchgefuhrt wird. Dabei wurden die Enzyme NotI und BamHI 
verwendet, die keine Restriktionsstelle innerhalb des magellanA 
haben und statistisch relativ haufig in GC-reicher DNA 
schneiden sollten . 

Verdau der qenomischen DNA mit NotI bzw. BamRI : 

5 pg DNA 
+ 3 pi 10 x NEB Puffer 
+ 3 pi 100 x BSA 

+ 10 U Restriktionsenzym (BamHI bzw. NotI) 
+ x pi dest. H 2 Q 

30 pi Ansatz fur 3 h bei 37 °C inkubieren => wiederum 10 U 
Enzym zu den Restriktionsansat zen zugeben und weitere 2 h 
bei 37 °C inkubieren. 

Fallen der restringierten DNA und folgende Ligation 

Der gesamte Restriktionsansat z von 30 pi wird mit 1 Vol. 
Chloroform/Phenol versetzt und fur 10 min. zentrif ugiert 
(13.200 rpm; 20°C). Der Oberstand wird in ein neues 
Reaktionsgefaii uberfuhrt und mit 1/10 Vol. 3 M NaOAc und 2,5 
Vol. 100% EtOH versetzt. Zum fallen der DNA wird das 
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ReaktionsgefaJJ far 1 h bei -20 °C inkubiert und anschlieftend far 
30 min. zentrifugiert (13.200 rpm; 4°C) . Der Oberstand wird 
verworfen und das Pellet dreimal mit 70% EtOH gewaschen, wobei 
man jeweils 5 min. zentrifugiert (13.200 rpm; 20°C) . Nach dem 
Verwerfen des Oberstandes wird das Pellet bei 37 °C getrocknet 
und in 15 pi H 2 0 resuspendiert . Fur die anschliefiende Ligation 
werden die gesamten 15 ul der gefallten DNA verwendet . 

Ligationsansatz : 15 pi dna 

+ 4 pi 5 x Ligasepuffer (NEB) 

+ 1 yl NEB Liqase 

20 pi Ansatz uber Nacht bei 16°C inkubiert 
=> wiederum 1 pi Ligase zu den Ligationsansatzen und ON 

bei 16°C inkubieren. 

Elektrotrans form ation der Liqationsansat ze in den E. coli ■ Stamm 
DH5a-Xpir 

1-3 pi der Ligationsansatze und 50 pi DH5a->.pir Zellen werden 
gemischt und in eine auf Eis gekahlte Elektroporationskuvette 
(0,1 cm) gegeben. Die Elektroporation wird bei 200 Q, 25 mF und 
l,25kV / cm durchgefuhrt. Die Zellen werden danach in 1 ml LB- 
Medium (10 g Trypton; 10 g NaCl; 5 g Hefeextrakt auf 1 1 Medium) 
aufgenommen und fur 1 h bei 37 °C inkubiert. Danach werden sie 
auf Kanamycin-haltigen (50 pg / ml) LB-Platten ausplattiert . 
Nach einem Tag Inkubation bei 37 °C konnen die Klone gepickt 
werden. Dabei kSnnen nur Zellen anwachsen, die ein Transposon- 
Plasmid beinhalten und damit eine Tn5-Kan R vermittelte 
Resistenz besitzen. 
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1.5 Sequenzauswertung des Tubulysin Biosynthese-Genclusters aus 
pMutT794/NotI 

Das Transposon-Plasmid pMutT794/AfotI enthalt 52985 bp 
chromosomale DNA aus Angiococcus disciformis An d48. Zusammen 
mit dem Himarl Minitransposon magellan4 (2199 bp) , das bei 
Basenpaar 37317 bp in das Plasmid integriert 1st, wurden 55184 
bp sequenziert. Insgesamt 21760 bp stairatien aus codierenden Genen 
des Tubulysin-Genclusters und sowie 31219 bp weitere codierende 
Gene. Bei diesen ORF's handelt es sich zum Teil urn 
Regulatorgene, die die Expression des Tubulysins beeinflussen 
konnen. Sequenzvergleiche mit den Transposon-Plasmiden der 
anderen Tubulysin Knockout-Mutanten zeigten, class das magellanA 
bei den Mutanten MutT781 (36975bp) und MutT929 (36197 bp) 
innerhalb von 1658 bp zu MutT794 in das Biosynthese-Gencluster 
transponiert sind. 

Auf der Sequenz ist der Start des Tubulysin-Genclusters mit drei 
NRPS-Modulen ( tubA-C J, einem Cyclodeaminase codierendem Gen 
(tubZ) und einem PKS-Modul ( tubD) enthalten. Desweiteren liegen 
innerhalb des Genclusters ein Anionentransporter codierendes Gen 
(ORF1), das zum Austransport des Tubulysins aus der Zelle dient 
und ein weiterer ORF (ORF2). Die grundsatzliche Anordnung der 
Gene, sowie der einzelnen Domanen mit einer N-Methyltransf erase 
innerhalb der Adenylierungsdomanen (A) von tubB und ■ tube, 
entsprechen dem typischen Aufbau des Genclusters und der damit 
verbundenen Tubulysin-Biosynthese . Die Methyltransf erase-Domanen 
(NMT) sind aber im Gegensatz zu bekannten Genclustern-Strukturen 
nicht zwischen der Adenylierungs- und Thiolierungs-Domane (PCP) 
lokalisiert, sondern zwischen A8 und A9 innerhalb der 

Adenylierungs-Domane (A) (hoch konservierte Regionen innerhalb 
der Adenylierungs-D6manen von NRPS; Konz & Marahiel, Chem. 
Biology, 6 (1999) R39 - R48). TubA, codiert eine unvollstandige 
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Kondensationsdomane, die fur die Biosynthese theoretisch nicht 
benotigt wird. Die am Ende der bekannten Sequenz liegende 
Polyketidsynthase (PKS) enthalt eine Ketoacylsynthase- (KS) , 
Acyltransferase- (AT) und Ketoreduktase- (KR) Domane. 

Die restliche Sequenz des Tubulysin-Biosynthesegenclusters wurde 
aus einer Cosmidbank von An d48 (hinterlegt bei der DSMZ) unter 
Standardbedingungen identif iziert . Das auf der ersten Halfte der 
Sequenz endende PKS-Modul (tubD) wird fortgesetzt von den schon 
erwahnten KS, AT und KR-Domanenund beeinhaltet weiterhin eine 
Enoylreduktase (ER) und ein Acyl Carrier Protein (ACP) . In der 
folgenden Sequenz von tujbD wird eine NRPS codiert, die eine 
Heterozyklisierungs- (HC) , Adenylierungs- (A) und Peptidyl 
Carrier Protein- (PCP) Domane tragt. Des weiteren folgen die 
Gene tubE und tubF . Das Gen tubE codiert eine NRPS mit den 
Domanen C, A und PCP. Auf tubF wird eine PKS mit der folgenden 
Domanen-Anordnung codiert: Ketoacylsynthase (KS) , Acyltrans- 
ferase (AT), Ketoreduktase (KR) , C-Methyltransf erase (CMT) , 
Dehydratase (DH) , Enoylreduktase (ER) , Acyl Carrier Protein 
(ACP) und schlieBlich eine Thioesterase, die fur die Abspaltung 
des fertigen Tubulysins in Form einer freien Saure vom 
Multienzymkomplex sorgt. Der Insertionsort des Transposons 
magellanl liegt bei der MutT176 bei Basenpaar 54579 innerhalb 
des Biosynthesegenclusters. Der Insertionsort der Mutante 
MutT524 liegt nicht auf der uns bekannten Gencluster-Sequenz . 
Wir postulieren daher, dass der Insertionsort innerhalb eines 
fur eine Acyltransferase codierenden Genes liegt, das downstream 
vom Tubulysin-Biosynthesegencluster liegt und eine 

posttranslationale Funktion zur Modifikation des Tubulysins hat. 
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Identif izierung der 



Anschlusssecpenz des Tubule 



Biosynthese-Genclusters aus Ancrxocoacus d±sa±form± s 



p sin- 



An d48 



2.1 Identif izierung und Charakterisierung von Cosxuiden, die eine 
iiberlappenede Sequenz downstream zum Tubulysin-Biosynthese- 
Gencluster tragen 



Im vorhergehenden Beispiel wurde beschrieben wie die erste 
Halfte des Tubulysin-Biosynthese-Genclusters zusammen mit 
weiteren an der Biosynthese beteiligten Genen mittels mariner 
basierender Transposon-Mutagenese und anschlieftendem Transposon- 
recovery identif iziert und annotiert wurde. Da innerhalb dieser 
Sequenz sowohl codierende Gene far Monooxigenasen als auch 
Acyltransferasen fehlen, sollte noch weitere Sequenz downstream 
von dieser identif iziert und charakterisiert werden. Innerhalb 
dxeser Sequenz sollten oben angegebene Gene codiert vorliegen 
da sie fur die Biosynthese des Tubulysins erforderlich sind. Dal 
Bxosynthese-Gencluster sollte auf diesem Weg vollstandig 
identif iziert werden. 



Hierfur wurde eine Cosmidbank aus A.disciformis An d48 mittels 
Gigapack II XL packaging-kit (Fa. Stratagene) in E.coli SURE 
erstellt. Innerhalb dieser Bank sollten Cosmide identif iziert 
werden, die einen langeren Oberlapp mit dem Tubulysin- 
Biosynthese-Gencluster downstream von tubF besitzen. Dafur 
warden aus der bekannten Sequenz des Tubulysin-Genclusters zwei 
Primerpaare abgeleitet und die PCR-Amplif ikate als Sonden fur 
die folgende Hybridisierung der Cosmidbank eingesetzt. Das erste 
Primerpaar ASTlslA-lB liefert ein 889 bp groBes DNA-Fragment und 
Ixegt 1 kb vor der ^oti-Restriktionsschnitts telle in tubD Das 
zweite Primerpaar ASTls2A-2B generiert ein 700 bp Fragment, das 
11 kb upstream im bekannten Cluster-Ende entfernt in tu.bC liegt 
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Die PGR wurde bei 54 °C Annealing-Temperatur durchgef uhrt . Durch 
diese Hybridisierung konnten verschiedene Cosmide identif iziert 
werden. Mittels PCR- und Restriktionsanalytik wurden sie auf die 
Grofle ihrer Oberlappung mit der bekannten Cluster-Sequenz 
untersucht. Hierfur wurden wiederum die Primerparchen ASTlslA/B 
und ASTls2A/B bei einer Annealing-Temperatur von 58 °C 

eingesetzt. Bei der Restriktion wurden verschiedene Enzyme in 
Einzel- und Doppelrestriktionen eingesetzt. 

Da die Cosmide F7 und F13 nach den Restriktionsanalysen einen 
ahnlich grofien Uberlapp mit dem ersten Teil des Clusters 
zeigten, tragt eines dieser Cosmide die notigen genetischen 
Inf ormationen, urn die unmittelbar clustergebundenen Gene 
identif izieren zu konnen. 

2.1.1 Southern-Analy tik der Cosmide F7 und F13 

Fur die Identif izierung des richtigen Cosmids wurden zunachst 
Restriktionsenzyme ausgewahlt, die moglichst selten und am Ende 
der bekannten Genclustersequenz schneiden. Die ausgewahlten 
Enzyme waren Ndel und Nsil, die an den Positionen 39306 bp bzw. 
39430 bp schneiden. Des weiteren schneiden beidfe Enzyme nur ein 
weiteres Mai in der bekannten Sequenz. Mit einer generierten 
Sonde (Primerpaar Tls up /down) , die hinter diesen Schnitten direkt 
am Ende der bekannten Cluster-Sequenz bindet, sollte nun die 
Cosmid-Genbank „gescreent" werden. Die Cosmide wurden dafur in 
verschiedenen Doppelrestriktionsansatzen hydrolisiert und auf 
einem Agarosegel (0,8 %) aufgetrennt. Fur die Doppelrestriktion 
wurden neben Ndel und Nsil die Enzyme BamHI, EcoRI, und NotI 
ausgewahlt. Durch die Kombinationen mit EcoRI und NotI sollte 
erreicht werden, dass mittels der Hybridisierung ein Fragment 
identif iziert wird, welches bis an das Ende des jeweiligen 
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Cosmid-Inserts reicht. Falls dieses Fragment fur eine 
nachfolgende Klonierung zu groli sein sollte, wurde ebenfalls 
BamHI verwendet, um eventuell kurzere Fragmente zu erhalten. Die 
Hybridisierung wurde bei 42°C durchgef uhrt , und es wurde unter 
hochst stringenten Bedingungen gewaschen (68°C). 

Das Ergebnis dieser Analytik war, dass in dem EcoRI / Ndel- 
Restriktionsansatz des Cosmids F7 ein 12 kb Fragment detektiert 
wurde. Dieses Fragment enthalt die restliche Seguenz des 
Tubulysin-Genclusters und reicht bis zum Ende der Insert-Sequenz 
des Cosmids. Diese Schlussf olgerung wurde aus den 
Restriktionsanalysen und der charakterisierten Oberlappung mit 
der Tubulysin-Genclustersequenz geschlossen. Das detektierte 
AfotI / Ndel - Fragment ergab eine Grofte von 4,2 kb. Daher muss 
innerhalb der 12 kb Insertsequenz von F7 - zwischen Anfang (Ndel 
- Restriktionsstelle) und Vektor (scos) - mindestens eine 
weitere Notl-Schnittstelle liegen. Somit kann die 
Anschlusssequenz in kleineren Fragmenten (als Ndel / NotI - und 
NotI / Afotl-Fragmente) kloniert und sequenziert werden. Der 
BamHI / Afeil Doppelrestriktionsansatz lieferte insgesamt funf 
Fragmente . 



' 2 ' Co?nid rU F7 g ^ reStl±Chen ^lysin-Genclustersequenz aus 

Das Cosmid F7 wurde in einem Doppelrestriktionsansatz mit den 
Restriktionsendonuoleasen Nsil und EcoRI geschnitten (2 h; 
37 °C). Nach einer Auftrennung des Restriktionsansat zes mittels 
0,8 %igem Agarosegel, wurde die entsprechende Bande aus dem Gel 
ausgeschnitten und mit dem NucleoSpin kit (Fa. Macherei-Nagel) 
extrahiert. Das isolierte Fragment wurde mit NotI 
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nachgeschnitten, urn zu uberprufen, ob die erzielten 
Hybridisierungsergebnisse bestatigt werden. Zusatzlich wurde 
uberpruft ob weitere Notl-Erkennungssequenzen innerhalb der 12 
kb Anschlusssequenz liegen, urn die Anzahl der zu klonierenden 
Teilf ragmente bestimmen zu konnen. Die Restriktion ergab nach 
einer Auftrennung auf einem 0,8%igem Agarosegel ein 4,2 (Nsil / 
Notl-Fragment) und 8 kb groBes Fragment (NotI / Notl-Fragment) . 

Zum einen konnte somit das Hybridisierungsergebnis bestatigt 
werden und beide Fragmente konnten fur eine Klonierung 
eingesetzt werden. Zum anderen wurde bestatigt, dass es sich urn 
das richtige Fragment handelt, das die Sequenz downstream des 
Clusters tragt. 

Kloniert wurden das 12,2 kb grofte Nsil / EcoRI - Fragment, sowie 
die 4,2 kb (Nsil / NotI) und 8 kb (NotI / NotI) groJien Fragmente 
in den Vektor pUC18 . Der Vektor wurde fur die folgende Ligation 
mit PstI / £coRI bzw. PstI / NotI und NotI geschnitten. PstI und 
Nsil besitzen ein kompatibles Schnittmuster , so dass nach 
erfolgreicher Klonierung, diese Schnittstellen nicht mehr 
vorhanden sind. Mittels Hindlll bzw. Ndel und EcoRI kann in 
einer Doppelrestriktion das 12 kb Insert wieder aus dem pUC18 - 
Derivat herausgeschnitten werden. 

Die erhaltenen Klone wurden mittels dieser Restriktionsansatze 
auf ihre Richtigkeit hin uberpruft. Einer dieser Klone (ASpUC12) 
wurde fur eine folgende in vitro Transposition mitels GPS™ - 1 
Genome Priming System (Fa. New England Biolabs Inc ) eingesetzt. 

2.1.3. In vitro Transposition mit GPS - 1™ Genome Priming 
System 
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Durch das GPS - l System sollte mittels einer auf Tn7 
basierenden in vitro Transposition das klonierte Nsil / Not I 
Fragment sequenziert werden. Bei diesem „kit" wird eine TnsABC 
Transposase verwendet, die das Transposon (Transprimer™) 
zufallig in die Zielsequenz inseriert. Durch spezifische 
Sequenzierprimer (PrimerN / PrimerS) , die aus den f lankierenden 
Enden des Transposons „herauslesen«, kdnnen die anliegenden 
Bereiche des DNA-Inserts sequenziert werden. Da das Transposon 
zufallig in die Zielsequenz inseriert wird, kann durch 
Sequenzieren einer bestimmten Anzahl generierter Transposon- 
Mutanten die gesamte Zielsequenz charakterisiert werden. 

Durchfiihrung der in vitro Transposition 

2 pi 10 x GPS-Puffer 
+ 1 pi pGPS 1.1 (vermittelt Kan R ) 

+ 0,2 pi Ziel - DNA (entsprechen 80 ng ASexp7) 
+ 14,8 u l dH 2 0 

18 pi Ansatz 

Dieser Ansatz wird gut durchmischt und mit 1 pi TnsABC 
Transposase versetzt (wiederum durchmischen) . Der gesamten 
Reaktionsansatz wird fur 10 min. bei 37°C inkubiert, damit sich 
die Transposase vor der eigentlichen Reaktion im Reaktionsansatz 
vermischt. Nach der Zugabe von 1 pi „start solution" wird der 
Reaktionsansatz fur eine Stunde bei 37°C inkubiert. In dieser 
Zeit kommt es zum Strangtransf er des Transposons in die Ziel- 
DNA. Danach wird die Reaktion durch Inkubation fur 10 min. bei 
75 °C abgestoppt. Von diesem Ansatz wurden 2 pi in E.coli DH10B 
transformiert und auf Kanamycin-haltigem Medium ausplattiert. 
Insgesamt sind ca. 2000 Klone nach einer Inkubation uber Nacht 
bei 37°C gewachsen. 
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Von diesen Klonen wurden 20 daraufhin uberpriift, in welchem 
Verhaltnis das Transposon in das Insert oder den Vektor 
inseriert ist. Dafur wurden diese Klone in einem 
Doppelrestriktionsansatz mit den Endonukleasen EcoRI und Hindi I I 
hydrolisiert. Die Rest riktionsanalyse ergab, dass bei 75% der 
Klone das Transposon in das Insert inseriert waren. Somit wurden 
192 Klone sequenziert, wodurch eine ca. 12 fache Abdeckung der 
Sequenz erreicht wurde (bei 500 bp Lauflange je Sequenzierung) . 



2.2 Sequenzanalyse und Annotieren der 12 kb Anschluss sequenz 

Die erhaltene Restsequenz des Tubulysin-Biosynthese-Genclusters 
ist 12.219 bp lang und hat eine Oberlappung mit der vorher 
identifizierten Sequenz von 133 bp. Sequenzabschnitte, bei denen 
nur eine einfache Abdeckung vorlag, wurden durch erneute 
Sequenzierungen spezifischer Klone doppelstrangig sequenziert. 
In dieser Sequenz ist von Basenpaar 6416 - 6898 eine 
Aoyltransferase (Position 76.787 - 77.545 bp in Gesamtsequenz ) 
codiert. Die anderen identifizierten ORFs haben ebenfalls eine 
Funktion in der Tubulysin-Biosynthese . Die Gesamtsequenz betragt 
somit 82.8 68 bp. 



3. Identifizierunq einer Tubulvsin uberproduzierenden Mutant, 
ir.r.erhalb der aarin er-Transposon Mutantenbank 

'Jus die Mutanten der. Transposonbank auf weitere auffallige 
Phanotypen gegenilber dem Wildtypen zu untersuchen, wurde eine 
—LC-Analytik durchgef uhrt . Dabei wurde Uberpruft, ob die 
des Tr ansposons in chromosomale Bereiche von 
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Biosynthese-Genclustern anderer, exprimierter Sekundarstof f e 
stattgefunden hat. Bei diesen Vergleichen gegenuber einem 
Extrakt des Wildtypen sollten dann nicht produzierende Mutanten 
der jeweiligen Metabolite identif iziert werden. Zu diesen 
Metaboliten gehbren das Myxothiazol (Gerth et al . 1980 J 
Antibiot (Tokyo) 33 (12) : 1474-1479 und Silakowski et al . 1999 J 
Biol Chem. 274 (52) : 37391-9, Myxochelin (Gerth et al . 1983 J 
Antibiot (Tokyo) 36 ( 9) : 1150-6. und Silakowski et al. 2000 Eur J 
Biochem. 267 (21) : 6476-85) und Angiolam (Kunze et al. 1985, J 
Antibiot (Tokyo) 38 ( 12 ) : 1 64 9-54 ) . 

Bei den Auswertungen der 1.200 HPLC-Laufe fielen Extrakte 
einiger Mutanten auf, bei denen eine erhbhte Myxothiazol- 
Produktion gemessen werden konnte. Zur Kontrolle der erzielten 
Ergebnisse wurden 50 ml M7-Medium Kulturen der entsprechenden 
Mutanten erneut getestet und Zeit kinetiken zur Myxothiazol- 
Produktion uber mehrere Tage gegenuber dem Wildtypen, erstellt. 
Die Ergebnisse dieser Untersuchungen zeigten eine deutlich 
erhbhte Produktion des Myxothiazols in den verschiedenen 
Mutanten bezogen auf den An d4 8 Wildtypen. 

Die Bestimmung des Transposon-Insertionsortes innerhalb der 
jeweiligen Mutante erfolgte mittels „Transposon-recovery" und 
anschlieftender Sequenzierung der f lankierenden Bereiche (siehe 
1.4). Die erhaltenen Sequenzen wurden gegen die Genbank auf 
Homlogien zu bekannten Genen hin untersucht und zeigten hohe 
Ahnlichkeiten zu regulatorischen Elementen / Genen aus 
bakteriellen Organismen. Aufgrund dieser Ergebnisse wurde die 
gesamte Mutantenbank auf Tubulysin uberproduzierende Mutanten 
hin untersucht. Dafur wurde der bestehende Toxizitatstest (siehe 
1.3) optimiert. Durch mehrfache Verdilnnungen des jeweilig 
eingesetzten Mutanten-Extraktes im Toxizitatstest wird eine 
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Ausdunnung des Tubulysins erreicht und somit ist die 
charakteristische Wirkung auf L929 Zellen ab einer bestimmten 
Verdunnung nicht mehr detektierbar . Anhand dieser 
Verdunnungsreihen (aus der gesamten Mutantenbank) warden 
Mutanten identif i ziert , bei denen signifikant hdhere 
Verdunnungen benotigt werden, um keine Wirkung mehr feststellen 
zu kSnnen. Dies bedeutet, dass die jeweiligen Mutante eine 
erhohte Produktion des Tubulysins aufweist. 

Es konnte die Mutante Mutl58 identif iziert werden, die eine 
vierfach erh6hte Tubulysin D-Produktion zeigte. Dieses Ergebnis 
wurde sowohl durch die Anzucht der Mutante in 50 ml Kulturen 
uber HPLC-MS Untersuchungen, als auch mehreren Kinetiken mit 
folgendem, optimiertem Toxizitatstest gegenuber dem Wildtypen 
gezeigt. Mittels des Toxizitatstest wurde sogar eine achtfache 
Uberproduktion von Tubulysinen ermittelt. Dabei wird die 
Gesamtwirkung aller Tubulysin-Derivate detektiert und nicht nur 
die von Tubulysin D. Die Mutante 158 weist somit vollig 
uberraschend noch Oberexpression von weiteren Tubulysinen 
gegenuber dem Wildtypen von A.disciformis auf. Dies war in 
keinster Weise zu erwarten gewesen. Die Ausklonierung des 
genomischen Bereichs unmittelbar an der Insertionsstelle des 
Transposons und die Sequenzierung erfolgte wie unter 1.4 
beschrieben. 

Die Sequenz des betroffenen Gens zeigt hohe Ahnlichkeiten zu 
einer Proteinkinase (aus Stigmatella aurantiaca) , wobei der 
Insertionsort des Transposons die Promotorregion dieses Genes 
betrifft. Ohne auf diesen Wirkmechanismus festgelegt zu werden, 
besitzt dieses Gen eine negativ regulatorische Funktion fur die 
Tubulvsin-Bildung, weshalb eine Inaktivierung des Gens zu einer 
erhohten Produktion f uhrt . Die Gesamtsequenz beinhaltet 2.200 
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bp, wobei die Proteinkinase von 
wird und insgesamt 1.209 bp grofi 
codiert ein Tubulysin-Biosynthese 
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Basenpaar 1.228 - 20 codiert 
ist. Der upstream liegende ORF 
Protein und ist 933 bp grofi . 



Primersequenzen : 

Sequenzierprimer fur das Himarl Mini-Transposon magellanA 
K-388: 5 '-+3' 5 TGG GAA TCA TTT GAA GGT TGG 3 SEQ ID NO: 39 

K-389: 3'-»5' 5 TGT GTT TTT CTT TGT TAG ACC G 3 ' SEQ ID NO: 4 0 

Primerpaar ASTlslA/B wurde abgeleitet aus tubD und ergibt ein 
889 bp Fragment 

ASTlslA 5 CAC CCG GAC CTG CCT GGA TTC 3 SEQ ID NO: 41 

ASTlslB 5 TGC TCG GCT GGC GCT ACT CAC 3 ' SEQ ID NO: 42 

Primerpaar ASTls2A/B wurde abgeleitet aus tube und ergibt ein 
700 bp Fragment 

ASTls2A 5 GCT CCC GGG CCA CGT GGT TGA AGA 3 ' SEQ ID NO: 43 
ASTls2B 5 CCG CGG GCC GTG GCA GTG GTG TA 3 ' SEQ ID NO: 4 4 



Primerpaar Tls up / Tls down wurde abgeleitet aus tubF und ergibt 

ein 125 bp Fragment 

Tls up S TGG CAG CCA GCC CGA GC 3 ' SEQ ID NO: 4 5 

Tls dOMn 5 'CCG CGG GTG CCC TCT CAT C 3 ' SEQ ID NO: 4 6 
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70% Identitat, 85% 
Ahnlichkeit zu Hitzeschock - 
protein GroEL 
[Rhodothermus marinus] 
AAD37976 


34% Identitat, 56% 
Ahnlichkeit zu 
hypothetischem Protein 
[Corynebacterium 
glutamicum] 
NP 616546 


33% Identitat, 47% 
Ahnlichkeit zu konserviert 
hypothetischem Protein 
[Streptomyces coelicolor] 
NP_631315 


30% Identitat, 51% 
Ahnlichkeit zu 

Transkriptionsregulator, TetR 
Familie [Caulobacter 
crescenius] 
NP 420005 


24% Identitat, 40% 
Ahnlichkeit zu Integrase 
[Corynebacterium 
glutamicum] 
NP_601233 


1.644 bp 
58.650 Da 


1.041 bp 
37.818 Da 


999 bp 
36.441 Da 


645 bp 
23.789 Da 


1.137 bp 
41.614 Da 


70.676-72.319 
(SEQ ID NO: 18) 




68.036-69.034 
(SEQ ID NO: 18) 






12.193- 10.550 
(SEQ ID NO: 1) 


12.841 - 13.881 
(SEQ ID NO: 1) 


14.833 - 13.835 
(SEQ IDNO'.l) 


14.942- 15.586 
(SEQ ID NO: 1) 


15.847- 16.983 
(SEQ ID NO: 1) 


ORF12: 

Hitzeschockprotein 
SEQ IDN0.29 


ORFIl: 

Tubulysin Biosynthese 
Protein 

SEQ ID NO: 3 


ORF10: 

Tubulysin Biosynthese 
Protein 

SEQ ID NO: 28 


ORF9: 

Transkriptionsregulator 
SEQ ID NO: 4 


ORF8: 
Integrase 
SEQ ID NO: 5 
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Patentanspriiche 

i. ss-DNA-Molekul, ausgewahlt aus der folgenden Gruppe: 



(i) 



ss-DNA-Molekul mit einer Sequenz gemafi Figur 1; 



(ii) ss-DNA-Molekul, das mit einem ss-DNA-Molektil gemafi (i) 

hinsichtlich seiner Nucleotid-Anzahl oder seiner 
Nucleotid-Sequenz zu jeweils 90, 91, 92, 93, 94, 95 
96, 91, 98, 99 oder 100% homolog ist, jedoch von dem 
ss-DNA-Molekul gemafi (i) hinsichtlich seiner 
Nucleotid-Anzahl und/oder seiner Nucleotid-Sequenz in 
mindestens einem Nucleotid abweicht; und 
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(iii) ss-DNA-Molekiil mit einer Sequenz, die zur Sequenz 

eines ss-DNA-Molekiils gemali (i) oder Tii) komplement&r 
ist . 

2. ds-DNA-Molekiil aus einem ss-DNA-Molekiil gemali Anspruch 1 
und einem dazu komplementaren Strang. 



3. ss-DNA-Molekiil, ausgewahlt aus der folgenden Gruppe: 
(i) ss-DNA-Molekiil mit einer Sequenz der Positionen 3.308 

bis 1 (ORF 16) der Sequenz gemSii Figur 1; 



(ii) ss-DNA-Molekiil mit einer Sequenz der Positionen 4706 
bis 3453 (ORF 15) der Sequenz gemali Figur 1; 

(iii) ss-DNA-Molekiil mit einer Sequenz der Positionen 5719 
bis 7164 (ORF 14) der Sequenz gemali Figur 1; 

(iv) ss-DNA-Molekul mit einer Sequenz der Positionen 9557 
bis 7317 (ORF 13) der Sequenz gemali Figur 1; 

(v) ss-DNA-Molekiil mit einer Sequenz der Positionen 12193 
bis 10550 (ORF 12) der Sequenz gemali Figur 1; 

(vi) ss-DNA-Molekiil mit einer Sequenz der Positionen 12841 
bis 13881 (ORF 11) der Sequenz gemali Figur 1; 

(vii) ss-DNA-Molekul mit einer Sequenz der Positionen 14833 
bis 13835 (ORF 10) der Sequenz gemali Figur 1; 



(viii) ss-DNA-Molekul mit einer Sequenz der Positionen 14942 
bis 15586 (ORF 9) der Sequenz gemali Figur 1; 
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(ix) 



(X) 



(Xi) 



(Xii) 



(xiii) 



(xiv) 



ss-DNA-Molekul rait einer Sequenz der Positionen 15847 
bis 16983 (ORF 8) der Sequenz gemali • Figur 1; 

ss-DNA-Molekul mit einer Sequenz der Positionen 21154 
bis 18809 (ORF 7) der Sequenz gemali Figur 1; 

ss-DNA-Molektil mit einer Sequenz der Positionen 22366 
bis 23532 (ORF 6) der Sequenz gemali Figur 1; 

ss-DNA-Molektil mit einer Sequenz der Positionen 24591 
bis 26513 (ORF 5) der Sequenz gemali Figur 1; 

ss-DNA-Molektil mit einer Sequenz der Positionen 26597 
bis 27517 (ORF 4) der Sequenz gemali Figur 1; 

ss-DNA-Molektil mit einer Sequenz der Positionen 29858 
bis 30400 (ORF 3) der Sequenz gemali Figur 1; 



(xv) 



(xvi) 



(xvii) 



(xviii) 



ss-DNA-Molektil mit einer Sequenz der Positionen 31220 
bis 32392 (TubA) der Sequenz gemali Figur 1; 

ss-DNA-Molekul mit einer Sequenz der Positionen 33056 
bis 32397 (ORF 2). der Sequenz gemali Figur 1; 

ss-DNA-Molektil mit einer Sequenz der Positionen 34195 
bis 33074 (TubZ) der Sequenz gemali Figur 1; 

ss-DNA-Molektil mit einer Sequenz der Positionen 35422 
bis 34205 (ORF 1) der Sequenz gemali Figur 1; 



(xix) 



ss-DNA-Molektil mit einer Sequenz der Positionen 35522 
bis 40147 (TubB) der Sequenz gemali Figur 1; 
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(xx) ss-DNA-Molekiil mit einer Sequenz der Positionen 40144 
bis 48021 (TubC) der Sequenz gemafi Figur 1; 

(xxi) ss-DNA-Molekul mit einer Sequenz der Positionen 48011 
bis 58558 (TubD) der Sequenz gemaA Figur 1; 

(xxii) ss-DNA-Molekul mit einer Sequenz der Positionen 58551 
bis 62096 (TubE) der Sequenz gemafi Figur 1; 

(xxiii) ss-DNA-Molekul mit einer Sequenz der Positionen 62103 
bis 70616 (TubF) der Sequenz gemafi Figur 1; 

(xxiv) ss-DNA-Molekul, das mit einem Molekul gemafi (i), (ii) , 
(iii), (iv), (v), (vi), (vii), (viii), (i X ), (x), 
(xi), (xii), (xiii), (xiv), (xv), (xvi), (xvii), . 
(xviii), (xix), (xx), (xxi), (xxii) oder (xxiii) unter 
stringenten Bedingungen hybridisierbar ist und 
insbesondere dieselbe Anzahl von Basen aufweist; und 

(xxv) ss-DNA-Molekul, das mit einem ss-DNA-Molekul gemafi 
(i), (ii), (iii), (iv), (v), (vi), (vii), (viii), 
(ix), (x), (xi), (xii), (xiii), (xiv), (xv), (xvi), 
(xvii), (xviii), (xix), (xx), (xxi), (xxii) oder 
(xxiii) hinsichtlich seiner Nucleotid-Anzahl oder 
seiner Nucleotid-Sequenz zu jeweils 90, 91, 92, 93, 
94, 95, 96, 97, 98, 99 oder 100% homolog ist, jedoch 
von diesem ss-DNA-Molekul hinsichtlich seiner 
Nucleotid-Anzahl und/oder seiner Nucleotid-Sequenz in 
mindestens einem Nucleotid abweicht; und 
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(xxvi) ss-DNA-Molekul mit einer Sequenz, die zur Sequenz 
eines Molekiils gemSB (i), (ii) , (iii), (iv) , (v) , 
(vi), (vii), (viii), (ix), (X), (xi), (xii), (xiii), 
(xiv) , (xv), (xvi), (xvii), (xviii) , (xix) , (xx), 
(xxi) , (xxii) , (xxiii) , (xxiv) oder (xxv) komplementar 
ist . 



4. ds-DNA-Molekul aus einem ss-DNA-Molekul gemSfi Anspruch 3 
und einem dazu komplementaren Strang. 

5. ss-DNA-Molekul, ausgewahlt aus der folgenden Gruppe: 



(i) ss-DNA-Molekul mit einer Sequenz der Positionen 35747 
bis 367 69 (Domane C des tubB-Gens) der Sequenz gemSB 
Figur 1; 

(ii) ss-DNA-Molekul mit einer Sequenz der Positionen 37184 
bis 39817 (DomSne A des tubB-Gens) der Sequenz gemfift 
Figur 1; 

(iii) ss-DNA-Molekul mit einer Sequenz der Positionen 38369 
bis 39730 (Domane NMT des tubB-Gens) der Sequenz gemSB 
Figur 1; 



(iv) ss-DNA-Molekttl mit einer Sequenz der Positionen 39818 
bis 4 0069 (Domane PCP des tubB-Gens) der Sequenz gemafi 
Figur 1; 

(v) ss-DNA-Molekul mit einer Sequenz der Positionen 4 0372 
bis 41397 (Domane C des tubC-Gens) der Sequenz gemSB 
Figur 1; 
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(vi) ss-DNA-Molektil mit einer Sequenz der Positionen 41824 
bis 43215 (Domane A des tubC-Gens) der Sequenz gemafi 
Figur 1; 

(vii) ss-DNA-Molekul mit einer Sequenz der Positionen 43216 
bis 434 61 (Domane PCP des tubC-Gens) der Sequenz gemafi 
Figur 1; 

(viii) ss-DNA-Molektil mit einer Sequenz der Positionen 43552 
bis 44 574 (Domane C des tubC-Gens) der Sequenz gemafi 
Figur 1; 

ix) ss-DNA-Molektil mit einer Sequenz der Positionen 44980 
bis 4 7 631 (Domane A des tubC-Gens) der Sequenz gemafi 
Figur 1; 

x) ss-DNA-Molektil mit einer Sequenz der Positionen 46153 
bis 4 7547 (Domane NMT des tubC-Gens) der Sequenz gemafi 
Figur 1; 

K±) ss-DNA-Molektil mit einer Sequenz der Positionen 47632 

bis 47868 (Domane PCP des tubC-Gens)" der Sequenz gemafi 
Figur 1; 



(xii) ss-DNA-Molektil mit einer 
bis 4 9321 (Domane KS des 
Figur 1; 

(xiii) ss-DNA-Molektil mit einer 
bis 50584 (Domane AT des 
Figur 1; 



Sequenz der Positionen 48011 
tubD-Gens) der Sequenz gemafi 

Sequenz der Positionen 4 9622 
tubD-Gens) der Sequenz gemafi 
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(xiv) ss-DNA-Molekul mit einer Sequenz der Positionen 51473 
bis 52309 (Domane KR des tubD-Gens) -der Sequenz gemaii 
Figur 1; 

(xv) ss-DNA-Molekul mit einer Sequenz der Positionen 53066 
bis 53980 (Domane ER des tubD-Gens) der Sequenz gem&fc 
Figur 1; 

(xvi) ss-DNA-MolekQl mit einer Sequenz der Positionen 54158 
bis 544 60 (Domane ACP des tubD-Gens) der Sequenz gem£B 
Figur 1; 

(xvii) ss-DNA-Molekul mit einer Sequenz der Positionen 54461 
bis 55870 (Domane HC des tubD-Gens) der Sequenz gemafi 
Figur 1; 

(xviii) ss-DNA-Molekttl mit einer Sequenz der Positionen 56000 
bis 57412 (Domane A des tubD-Gens) der Sequenz gemSB 
Figur 1; 

(xix) ss-DNA-Molekul mit einer Sequenz der Positionen 57413 
bis 57 643 (Domane PCP des tubD-Gens) der Sequenz gemafl 
Figur 1; 

(xx) ss-DNA-Molekul mit einer Sequenz der Positionen 58689 
bis 59714 (Domane C des tubE-Gens) der Sequenz gemafi 
Figur 1; 



(xxi) 



ss-DNA-Molekiil mit einer Sequenz der Positionen 60156 
bis 61697 (Domane A des tubE-Gens) der Sequenz gemaft 
Figur 1; 



WO 2004/022586 




PCT/EP2003/009780 



(xxii) ss-DNA-Molekul mit einer Sequenz der Positionen 61698 
bis 61967 (Domane PCP des tubE-Gens) der Sequenz gemSIi 
Figur 1; 

(xxiii) ss-DNA-Molekul mit einer Sequenz der Positionen 62127 
bis 63320 (Domane KS des tubF-Gens) der Sequenz gemSJJ 
Figur 1; 

(xxiv) ss-DNA-Molektil mit einer Sequenz der Positionen 63711 
bis 64 67 6 (Domane AT des tubF-Gens) der Sequenz gemafi 
Figur 1; 

(xxv) ss-DNA-Molekul mit einer Sequenz der Positionen 64 959 
bis 65882 (Domane KR des tubF-Gens) der Sequenz gemafi 
Figur 1; 

(xxvi) ss-DNA-Molekul mit einer Sequenz der Positionen 65985 
bis 67061 (Domane CMT des tubF-Gens) der Sequenz gemafj 
Figur 1; 

(xxvii) ss-DNA-Molekul mit einer Sequenz der Positionen 67242 
bis 67829 (Domane DH des tubF-Gens) der Sequenz gemafc 
Figur 1; 

(xxviii) ss-DNA-Molekul mit einer Sequenz der Positionen 68247 
bis 69128 (Domane ER des tubF-Gens) der Sequenz gemSB 
Figur 1; 

(xxix) ss-DNA-Molekul mit einer Sequenz der Positionen 69360 
bis 69605 (Domane PCP des tubF-Gens) der Sequenz gem^Ji 
Figur 1; 
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(xxx) ss-DNA-Molekttl mit einer Sequenz der Positionen 69759 
bis 70586 (Domane TE des tubF-Gens) der Sequenz gemaii 
Figur 1; 

(xxxi) ss-DNA-Molekttl, das mit einem Molekul gemaii (i) , (ii) , 
(iii), (iv), (v), (vi), (vii), (viii), (ix), (x) , 
(xi), (xii), (xiii), (xiv) , (xv), (xvi) , (xvii), 
(xviii) , (xiv) , (xx), (xxi), (xxii) , (xxiii) , (xxiv) , 
(xxv), (xxvi), (xxvii), (xxviii) , (xxix) Oder (xxx) 
unter stringenten Bedingungen hybridisierbar ist und 
insbesondere dieselbe Anzahl von Basen aufweist; 



(xxxii) ss-DNA-Molekttl, das mit einem Molekttl gemaii (i) r (ii) , 
(iii), (iv), (v), (vi), (vii), (viii), (ix) , (x) , 
(xi), (xii), (xiii), (xiv), (xv) , (xvi), (xvii) , 
(xviii), (xiv), (xx) , (xxi) , (xxii), (xxiii), (xxiv), 
(xxv) , (xxvi), (xxvii), (xxviii), (xxix) oder (xxx) 
hinsichtlich seiner Nucleotid-Anzahl oder seiner 
Nucleotid-Sequenz zur jeweils 90, 91, 92, 93, 94, 95, 
96, 97, 98, 99 oder 100% homolog ist, jedoch von 
diesem ss-DNA-Molekttl hinsichtlich seiner Nucleotid- 
Anzahl und/oder seiner Nucleotid-Sequenz in mindestens 
einem Nucleotid abweicht; und 



(xxxiii) ss-DNA-Molekul mit einer Sequenz, die zur Sequenz 
eines Molekuls gemaii (i) , (ii) , (iii), (iv) , (v) , 
(vi), (vii), (viii), (ix) , (x) , (xi) , (xii), (xiii), 
(xiv), (xv), (xvi), (xvii), (xviii), (xiv), (xx) , 
(xxi), (xxii), (xxiii), (xxiv), (xxv), (xxvi), 
(xxvii), (xxviii), (xxix), (xxx), (xxxi) oder (xxxii) 
komplementar ist. 
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6. ds-DNA Molekul aus einem ss-DNA-Molekiil gem^Ji Anspruch 5 
und einem dazu komplementarem Strang. 

7. Varianten oder Mutanten, die aus einer Substitution, 
Insertion oder Deletion von Nucleotiden oder einer Inversion von 
Nucleotid-Segmenten eines ss-DNA-Molekttls gemSIi Anspruch 1, 3 
oder 5 oder eines ds-DNA-Mole kills gemali Anspruch 2, 4 oder 6 
resultieren, wobei diese Varianten und Mutanten Enzym-Varianten 
oder Enzym-Mutanten fur die Produktion von Sekundarstof f (en) mit 
fur Tubulysine charakteristischen Eigenschaf ten codieren. 

8. RNA 

(a) mit einer Sequenz entsprechend der eines ss-DNA-Molektils 
gemali einem der Anspruche 1, 3, 5 oder 7 oder 

(b) mit einer Sequenz einer RNA gemSfi (a) , aber in 
Gegenrichtung (anti-sense) , oder 

(c) mit einer Sequenz einer RNA gemaB (a) oder (b) und mit 
einem dazu komplementaren Strang, 

jeweils gegebenenf alls als Element eines rekombinanten Vektors. 

9. Rekombinanter Vektor, insbesondere Expressionsvektor mit 
einem DNA-Molekul gemali einem der Anspruche 1 bis 1. 

10. Zelle, insbesondere zur Expression, in die ein DNA-Mol e kill 
gemafl einem der vorhergehenden Anspruche oder einen Vektor gemali 
Anspruch 8 oder 9 integriert ist. 

11. Zelle nach Anspruch 10, wobei sich die Zelle von 
kultivierbaren Bakterien, insbesondere Myxobakterien, 
vorzugsweise Angiococcus, insbesondere A. disciformis, 
Archangium, insbesondere A. gephyra, Escherichia coli, 
Pseudomonaden oder Actinomyceten herleitet. 
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12. Verwendung eines Vektors gemaft Anspruch 8 oder 9 fur die 
Transformation von Zellen oder Organismen zur transienten oder 
permanenten Expression eines oder mehrerer Proteine 
(Expressionsprodukt (e) ) , das (die) durch eine DNA (ssDNA oder 
dsDNA) des Vektors codiert wird (werden) . 

13. Verwendung einer Zelle gemSJi Anspruch 10 oder 11 zur 
enzymatischen Biosynthese, Mutasynthese oder Partial-Synthese 
eines Tubulysins, insbesondere Tubulysin A, B, C, D, E und/oder 
F. 

14. Expressionsprodukt eines DNA-Molekuls gemaB einem der 
Anspruche 1 bis 7 oder eines Vektors gemaB Anspruch 8 oder 9 
oder einer Zelle gemafi Anspruch 10 oder 11. 

15. Polynucleotid enthaltend eine Sequenz wie in SEQ ID NO: 1, 
18, 33 oder 36 definiert, oder ein Fragment davon. 

16. Polynucleotid nach Anspruch 15, wobei das Fragment eine im 
Sequenzprotokoll definierte CDS ist. 

17. Vektor enthaltend ein Polynucleotid nach Anspruch 15 oder 
16. 

18. Zelle enthaltend einen Vektor nach Anspruch 17. 

19. Polypeptid enthaltend wenigstens eine Sequenz wie in SEQ ID 
NO: 2 bis 17, 19 bis 32, 34, 35, 37 und/oder 38 definiert 
und/oder ein Fragment und/oder Derivat davon. 
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20. Verwendung wenigstens eines Polynucleotids wie in SEQ ID 
NO: 1, 18, 33 und/oder 36 definiert und/oder wenigstens eines 
Fragments davon und/oder wenigstens eines Polypeptids wie in SEQ 
ID NO: 2 bis 17, 19 bis 32, 34, 35, 37 und/oder 38 definiert 
und/oder wenigstens eines Fragments davon zur Herstellung einer 
pharmazeutischen Zusammensetzung zur Behandlung von 
unerwttnschtem Zellwachstums oder unerwtinschter Zellvermehrung in 
einem Individuum. 

21. Verwendung nach Anspruch 20, wobei das unerwunschte 
Zellwachstum oder die unerwunschte Zellvermehrung ein Tumor ist. 

22. Verwendung nach Anspruch 20, wobei das unerwtinschte 
Zellwachstum eine pathogene Infektion ist. 

23. Verwendung nach Anspruch 22, wobei die pathogene Infektion 
eine Mykose, Malaria oder eine parasit&re Erkrankung ist. 

24. Pharmazeutische Zusammensetzung, enthaltend wenigstens ein 
Polynucleotid wie in SEQ ID NO: 1, 18, 33 und/oder 36 definiert 
und/oder wenigstens ein Fragment davon und/oder wenigstens ein 
Polypeptid wie in SEQ ID NO: 2 bis 17, 19 bis 32, 34, 35, 37 
und/oder 38 definiert und/oder wenigstens ein Fragment davon. 

25. Pharmazeutische Zusammensetzung nach Anspruch 24, die 
weiterhin wenigstens einen pharmazeutisch vertrSglichen 
Tragerstoff enthalt. 

26. Verfahren zur Herstellung von Tubulysinen und Tubulysin- 
Biosynthese Proteinen, umfassend die Schritte: 

(a) exprimieren von wenigstens einem Polynucleotid wie in 
SEQ ID NO: 1, 18, 33 und/oder 36 definiert und/oder 
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wenigstens einem Fragment davon und/oder wenigstens 
einem Polypeptid wie in SEQ ID NO: 2 bis 17 , 19 bis 32 , 
34, 35, 37 und/oder 38 definiert und/oder wenigstens 
einem Fragment davon, und 
(b) aufreinigen der Expressionsprodukte . 

27. Verfahren nach Anspruch 26, wobei die Expression in 
prokaryotischen oder eukaryotischen Zellen und/oder durch in 
vitro Expression erfolgt. 

28. Verfahren zum Auffinden von Genen, die an der Biosynthese 
von Tubulysinen beteiligt sind, umfassend die Schritte: 

(a) hybridisieren von wenigstens einem Polynucleotici wie in 
SEQ ID NO: 1, 18, 33 und/oder 36 definiert und/<pder 
wenigstens einem Fragment davon mit DNA, RNA, und/oder 
cDNA einer Spezies, die nicht mit Angiococcus 

disci formis identisch ist, und 

(b) isolieren und charakterisieren der hybridisierten DNA, 
RNA, und/oder cDNA. 

29. Kit zur Herstellung von Tubulysinen, enthaltend: 

(a) wenigstens ein Polynucleotid nach Anspruch 15 oder 16 
cmd/oder wenigstens einen Vektor nach Anspruch 17 
oder 

(b) geeignete Medien und Puffer zur Vermehrung von Zellen, 
die eine Expression des Polynucleotids und/oder Vektors 
erlauben und 

(c) geeignete Mittel zum Aufreinigen des/der 
Expressionsprodukte (s) . 
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30. Verwendung einer Zusammenset zung enthaltend wenigstens ein 
Polypeptid wie in SEQ ID NO: 2 bis 17, 19 bis 32, 34, 35, 37 
und/oder 38 definiert und/oder wenigstens ein biologisch aktives 
Fragment oder Derivat davon als Desinf ektionsmittel . 

31. Verwendung nach Anspruch 30, wobei die Zusammenset zung 
fliissig oder pulverf ormig ist. 



32. Desinf ektionsmittel wie in Anspruch 30 oder 31 definiert. 
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Figur 1 sequence listing 

<110> Gesellschaft fuer Biotechnologische Forschung mbH 

<120> Tubulysin-Biosynthesegene 

<130> 13289 

<160> 1 

<170> Patentln version 3.1 

<210> 1 

<211> 70782 

<212> DNA 

<213> Angiococcus discif ormis 



<400> 1 



gcggccgcct 


tcttcaccgc 


gggggcctcg 


gcctcctcgg 


tggtcggggc 


gggcgccagc 


60 


ttgtcgccgt 


gcaccttgcc 


gggagcggtc 


gccttcttct 


tcgaggcgga 


gcgcacgggc 


120 


tcgccctgcg 


tcacctcgcg 


cgtgggcagc 


atgtccccgt 


gggccttcac 


cgggccggac 


180 


tcaccctcct 


ccgcctcggc 


gccctcggcc 


ggcgtctcgc 


gggtgtcgcc 


accctccttg 


240 


gcccggttga 


actcgcgcat 


cgcgtcgtcc 


accagcccca 


tggtcatgta 


cgccacgccg 


300 


aggtcatggt 


ggtcggatgg 


cgacaggccc 


tccttcgtgc 


cctcgcgcag 


cttcgcgagc 


360 


gcgtcctgca 


cctgcgtgtc 


cccggcctcg 


gacgcggtgc 


cggtctcgcc 


cagttcaccc 


420 


ttcagctcct 


ggcccaggtc 


cacgccgccc 


ttctcttcct 


gattcggggc 


gaccttcacc 


480 


tgggccgggg 


cggaggggga 


ggcaccctct 


tccgcggtgg 


gggtgctgct 


ctcgggagtg 


540 


gtgctctcgg 


acggcggcgt 


ctctggcatg 


gcgggctccg 


gggcaatccg 


ctgcaggtgg 


600 


tcctgaagct 


tggccttgcg 


ctccttcagc 


tcctcgacgc 


gggcgcggtc 


cttctccacc 


660 


acgtccgggg 


gcgccttggc 


cacgaagttg 


gggttctcca 


gcttgcgcag 


cacgccggcc 


720 


atctcctgct 


cggcgcgggc 


aatctccttc 


ttcaaccgct 


cgcgctccgc 


gtccaggtcg 


780 


acgaggcccg 


ccagcgggac 


gtagatctcc 


aggttggagc 


ccacgaaggc 


ggcggcctgc 


840 


ggcggcttgg 


cgcccggcgc 


tcccaccgtc 


acctcggaca 


ggccggccag 


cggcatgagg 


900 


tagccgcgcc 


agcgctccag 


cagctcgcgc 


gtgcgcacgt 


ccgcgctctg 


caccaccgcc 


960 


ttcaccttgg 


tggcgggcga 


caggttgctc 


tcgccgcgga 


tggtgcgcag 


gccctcgatg 


1020 


gcggcgatga 


ccggcgccat 


ctcgccttcc 


gccgcctcgt 


ccacgtgcgc 


cgcgtccggc 


1080 


tccgggtacg 


gggcaatcat 


gatgctgtcc 


gtcggccggg 


ccatcggcag 


cttctgccag 


1140 


atctcctcgg 


tgatgaacgg 


catgaacggg 


tgcagcagcc 


gcaggatgcg 


gtccaggcag 


1200 




cccggcgcgt 


ggagtccttc 


gccgccgcgt 


cctcgccgta 


cagcgagccc 


1260 




cgatgtacca 


gtcgcagaac 


tccgcccaga 


ggaactggta 


gagcgtggag 


1320 


gcggcctcgc 


cgaagctgta 


cgactccagc 


agcgcgcgcg 


cctccgtggt 


ggcgcgctgg 


1380 
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2/38 



aggcgggaga 


gaatccagcg 


gtccgccagc 


gtcagtgtgc tccggtccag 


f-r f-r f~r r+Ct(*t 

egggcuy y 


1440 


tccatccgga 


agtcgcccat 


gttcatcagg 


acqaaqcqqc 


tggcgttcca 


cagcttgttg 


1 SOO 

1 J 


ccgaaggcct 


tgtagccctc 


gagccggtcc 


atcgacagct 


tgatgtcgcg 


gccctgctgg 


1 SfiO 


gtgagcgagg 


cgagcgtgaa 


gcgcagcgcg 


tccgcgccga 


aeggeggcat 


gcccugcggg 


1 fi?0 

J. U ^. vj 


aaccggttct 


tcagcgtcgg 


ctggagcttg 


teggcegggg 


cgccgaggac 


gatgtccagg 




gggtcgatga 


cgttcccctt 


cgtcttggac 


atcttctggc 


ccttctcgtc 


t*t /** a /""• ^ 4~ 

yCguaCCoLU 


1740 


gcgtgcaggt 


acacggtgcg 


gaagggcaca 


tcccccatga 


agtggatgcc 


4- «-» -5 4* !s 4" 


i ft no 


atccgggcga 


cccagaagaa 


gatgatgfccg 


fcggcccgtct 


ecatgaegga 


y**- 4— — w >»y- ^ *Sk 

cgtcgggtag 


i ft fin 


aaggtcttca 


gctccggcgt 


ctcacgcggc 


cagcccagcg 


tqqaqaacqq 

3 J 3 3 3 


ccacaggccg 




gacgagaacc 


aggtgtccag 


cacgtccggg 


tcctggatga 


aqgacqcqcc 

3 3 3 3 


gccgcacttc 


1 QftO 


gggcaggact 


tcggctgctc 


gcgcgcgacg 


atgggctccg 


cqcqcgcqaa 

3 3 3 


yuccaccccy 


<L \J *± \J 


cccaccttca 


ccgtgggggc 


gtccagcggc 


aggtccgtgt 


cgtcgccctg 


ccgcgggctg 


9i nn 


cacgaggtgc 


agtagfcacgc 


gggaat ctgg 


tggccccacc 


acagctggcg 

— 3 3 3 3 


gCLgacycac 


91 fin 


cagtcgtgga 


tgttgcgcat 


ccagtggaag 


aacgtgttcg 


tccaggactc 


ggggacgaac 


9 9 9 n 


ttcgtgcggc 


cctgctccac 


cgcctcgatg 




ccagcggttc 


aatctugaty 


9 9 ft n 


aaccactgcg 


gagacaggcg 


tggctccacc 


a ccatcrcf ccf c 


agegctggea 


ggtgcccacg 


Zofl U 


ttcagcatgt 


ggggctcttc 


cttctccagc 


agcccctgct 


ccgtgaggtc 


cgccagcacc 


9 /i on 


tgcttgcgcg 


cctcgaagcg 


atccatgccg 


gcgtacttgc 


cggtctcctt 


cgtcatccgg 


9 yi ^ n 


gccgcttcgt 


ccaggatggt 


gagcatgggc 


agcttgtgcc 


gcaggcccgt 


ctggtagt eg 




ttgaagtcgt 


gcgccggcgt 


caccttcacc 


acgccggtgc 


egaacttegg 


gtccaccagc 


ocqn 


tccgcgtccg 


cgatgatggg 


aat ct eg egg 


teggtcageg gcagcttcac 


gctcttgccg 


ZOIU 


gccagcccca 


ggtagcgctc 


gt cctccggg 


tggatggcca 


ccgcggtgtc 


gcccagcatc 


9 "7 o n 


gtctccgggc 


gtgtggtggc 


*— 1 --3 i^f #-t +- rf-C *T 

caegg tgagc 


gtgeggtege tgtccttgac ggggtagegg 


9 n fin 


atgtgccaga 


tggagccctt 


ctt cgactcc 


tcgtgctcca 


cctccaagtc 


getgagggeg 


9 Q9 n 


gtgcggcacg 


aggggcacca 


gttgatgagc 


ttctgggccc 


ggtacatcag 


gccctcttcg 


O O Q A 


tacaggcgca 


cgaagacctc 


gcgcacggcg 


geggaggact 


gctcgtccat 
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ggcggcagca cgatggagaa 
cgctccatcc agcaggcgta 
agttcagtgg tgtcggtcat 
ggtcttggag agaggttaag 
agccgctcga gctcctcacg 
atcttctcga tgacctcgcg 
ccatccgcgg cggtcggacg 
gcggccgtcg gggggccgcc 
agcgagtcct cgatgctgat 
aggccgaacg ggtccctcgc 
agcggcgcgg cgccgcccgg 
ggacgggcca tgccggcgcc 
tgcccggcgc ggacgcaccc 
ggcggcatgc ccgggcccgg 
ggccccggcg ggcgggccat 
gcgcccggag cacccggagg 
ggcggcattc caggccccgg 
cccggaggac gcggcacgcc 
ggcgcgcccg gctgcgggcc 
gggcgcacct gcgtggccgc 
ttcaccttgt cgagcagcac 
gcggcgcggg cgcggttctc 
atgccctggg tggccgggtc 
ggcatcatca catcggcgag 
tccagcccgt tgtccaccgc 
accttgcgga tggtgagcga 
ctcggggggc gtcccacccc 
cgcggccctg tatcaagcgg 
tgcgctgcag gtccaggaac 
tggtggtctc cggctcgaag 
cgttccggcc ggccagctcc 
gcagcgcggg gaccgagcag 



ggcgggcttg 
ccagcgggcc 
tgcggacggc 
gcgaggaacg 
gatgatggtc 
ggaggccttc 
ggccgtggcc 
catgtccagg 
ggaaggctgg 
gcgggccgcc 
aggacgcggg 
ggcccacccg 
ggcggaggcg 
aggacgcggc 
gcccggtccc 
cggcattcca 
aggacgcggc 
cggctgggtg 
cggaggcacc 
ggaggccggc 
ctggctctcg 
gtcgaaggcc 
gctcttcagg 
gacgacgtcc 
ggtcacctga 
gtcatcggcg 
cctgaattcg 
ggcagcttac 
agcacggagc 
tgctgcagca 
gtgaggatga 
atgggccaca 
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tcgacctcgg 
cagccccggg 
cgtcagtgct 
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gagagcgcct 
ggagcggcgg 
gagatctcct 
gctgcctgcg 
ggctgggtcg 
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gcgcccggag 
ggcggcattc 
ggcccgggag 
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ccctgcggaa 
ggcgcgggag 
atggtgttgg 
aagggcttgg 
tcgaaggtcc 
gcctcgcaga 
gggcgcagct 
aagtcctcgg 
accagcagat 
ttggagattc 
gtttcgcgct 
ctgtcaaccc 
cgccgagcac 
agagggcctc 
gcacctgggc 
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cctcggcgcg gaagtagttc 3300 
tgggctcgta ggccttggac 3360 
ggaaccgggg ccggcgggaa 3420 
gcgtctcccg gtccttgatg 3480 
gcggtaccac ctcccaggca 3540 
cgcgcagcag cgcctcgcca 3600 
tcggcgccac cgcggtcgga 3660 
cggcaccgct gggctcgggc 372 0 
gcgccgccgt gggagcgccc 3780 
cgccacccgg cgcgggaggc 3840 
ggctgccctg gggaatcccc 3900 

cacgcccgga cgcggcggca 3960 

gacgcgcggc ggcgcccgga 4020 

cacccggcgg cggcattcca 4 08 0 

caggccccgg aggacgcggc 414 0 

gacgggccat gcccggcccc 4200 

cacccggcgg catgcccggg 4260 

tgcccggacg ggcggccgtg 4320 

tggccggctg ggcggtgggc 4380 

acttctgccc caccagcgcc 4440 

cgatgtggtc atcggcgcgg 4500 

ccgccagcag caccacgggg 4560 

cctcgtagcc gctcttgccc 4620 

cgcgggtacg ggagatggcg 4 68 0 

tcgcgaagat catcccaatc 4740 

tcttgggcat cggttcgggc 4800 

ctgccgcgaa aaacccaggt 4860 

ccagcacgat caagaaaaca 4920 

gggggcacgg aaggctccaa 4980 

ccgggtggcc gtcagcccca 5040 

gggtaccggg ggggagccca 5100 

gtgcgggaag atgccggcca 5160 
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ctgcacgggc 


aggacgctga 
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tgaccaccgc gccctcgacg ccggcgttgg aggcgatctt ccgcagcggc tcctggagcg 108 60 

cgcggcggat gatctccacg ccgaagtcct gctcaccgcc cagcttcagc ttctccagcg 10920 

cgggcaggca gcgcaggaag gccacgccgc cgccagggac gatgccctcc tcgacggccg 10980 

cgcgagtcgc atgcagcgcg tcctccacgc gggccttctt ctccttcatc tcggtctcgg 11040 

tcgcggcgcc cacgttgatg acggccacgc cgcccaccag cttggccagc cgctcctgga 11100 

gcttctcgcg gtcgtagtcg ctggtgacgg tgtcgatctg cgagcggatg agcttgatgc 11160 

ggccctcgat ctcgctcttg gtgccggcgc cgtcgacgat ggtggtgttg tccttgtcca 11220 

ccgtgatgcg cttggagcgg cccaggtcgt tgagcgtcag gttctcgtac ttgtggccca 11280 

gctcctcgct gacgaccatg ccgcccgtca gggtggcgat gtccttcagc atctccttgc 11340 

ggcggtcacc gaagcccggc gccttcaccg cggccacgtt cagcacgccg cggatcttgt 11400 

tgaccaccag ggtggccagc gcctcgccct cgatgtcgtc ggcgatgatg agcagcggct 114 60 

tcccggagcg cgccacctgc tccaggatgg gaatcatgtc ctgcatcgac gagaccttct 11520 

tctcgctgat gaggatgaag gggtcgtcca tgacgacctc catgcgctcg cggttcgtca 1158 0 

cgaagtacgg agagacgtag ccgcggtcga actgcatgcc ctccaccacg tcgaggttgg 11640 

tctccaggcc cttggcctcc tcgacggtga tgacgccctc cttgcccacc ttctccatcg 11700 

cgtccgcgat gatggagccg atggtctcat caccgttggc ggagatggtg cccacctggg 11760 

tgatggcctt cttgtccgcc gtgggcttgg acagcttctt cagctcctcc acgacgacct 11820 

cgacggcctt gtcgatgccg cgcttgaggt ccatggggct gtggccggcg gccaccagct 11880 

tgaggccctc ctcgtagatg gcacgcgcca gcaccgtggc ggtcgtggtg ccgtcgccgg 1194 0 

ccttgtcgga ggtcttcgac gcgacctcct tcaccatctg cgcgcccatg ttctcgaact 12000 

tgttgtcgag gtcgatctcc ttggcgacgg tgacgccgtc cttggtgatc gtgggcgaac 12060 

caaagctctt ctcgatgacc acgttgcggc ccttggggcc cagggtcacc gccacggcat 12120 

ccgcgagggt ccggacaccg cgaaggatgg cctcacgcgc ggactggtgg aagaaaatct 12180 

ccttcgctgc catcgcaacc tcctgaaatt actgggggat ttaactgtga cgacgctggc 1224 0 

cgttagcact cggccatggc gagcgccaac ataagggcca gcgaaatgat gtcaaccagg 12300 

aagggcgtgc ctgtgggcga gcggacgggg ccagggcccc tcggggccgt ctggccgggg 12360 

ggctaetcgg cgaagcgcac gaggttgagc agctgaccca tgtccagggg tttgcgcagc 12420 

gtcagcgcca cgcccttgcg caggccccgg tcggtgacgc cctcgtcctg ggcgcccgtc 12480 

atcagcagca cgggaatggc ctggaagcgg gggtcggcct tgagcatctc cgtgaagtgg 1254 0 

atgccgtcca ggtggggcat gaggtaatcg gtgatgacca tgcccacctg gttgcgctcc 12 600 

aggatgtcga gcgcctgcag ggcatccgcc gccgagtaga ccgtgtagcc atgcatctcc 12660 

aggaagcgag agagcatggt gcacagctcg agatcgtcat cgacgacaag gatgttcacg 12720 
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caacggtggc tccggccgag caagcaagga gcctgtacgg atatagctga agttctcgat 18420 
catggacttg tactcaaccg gtctccgtcg gaggccgtgg atggtgtcga aggtgtggat 18480 
cgagccggtg gccaccagtc cgtccctatt ctgaatgtat cgaatctctc cgaggtcctt 18540 
ccgggcgctc gccctgcccg cctccgtgcc ggtgaaggcg aacgggaggc gcttggcccg 18600 
gcacctgagc cccgttggac gccatggatg gcccggaagg gtcacagcac cggctacttc 18660 
agccctttac tcgtgctgcg gcgtcatcgt cctgacttcc ccatagcctt ccatcccttc 18720 
cgatccttcc acctctctct gcaaccaggg ggagaccgaa gcagacagca tggaaggcat 18780 
cgccgcagtc ggaacgcatc cgttccaatc acgggtgcgc aaggtccatc cgagcgccag 18840 
cggcgatgct ctccatcacc ccgtcgatct cggtctccca gcgtttcacc gtgacccggt 18 900 
acttgattcc atcgctccgc ttccagcccg tgttctcttc accggccaac tccttcatgc 18960 
gttcaccgaa cttcgtgagc gagaacgcca tccggtcacg gccctggtac ttcaagtcct 19020 
cgacgtagtg gccgtacagc gccgtggcct tgatccactg gtcccgctgc gcgcaccggc 19080 
gctcgaagaa atcttccacc gagtccttgg ctcgaatctc atcctggcgg gcctggacct 19140 
gcgccaattg gccgaggagc gggctctccg ccttctcgtc cacgctcgtc cacatggcca 19200 
ggaagtcgat ctgctccagt gccgtccagt tcatctgggg cagacagtcc agttgataga 19260 
agcgtcgcat tcccgtggga tcgacgacga tctccttcac ctgcttgttg gacgcaccga 19320 
tgaaggtggc gttgttggtg ccggtctgga gcttctggac accgagcacc cggtactcga 19380 
tccgctcggc ggtgatgcgg ttcttgagca cgtcagacgc tacccgctct gccttcgcca 19440 
tttcgtcgaa gaacatgccg tagcagcgag tcaaccggaa ctgctggcgc tcatcggtca 19500 
ggaactggag atcggcgggg aacccaacga gatcgccaag aggctgtagc aggctcttga 19560 
tggccgtagt cttcccgccc cgctgtttgc cgacgaagat cggcatcagg tggtccttga 19620 
ccggcttccc gaacaacttc cgcttcacct gccaaatgaa gtgaagaacg accacgagat 19680 
ctaccgggtc aaggcgtccc gtggttgctc gaatgaactc ctcggcgtca cctcggcgtc 19740 
cgtcctcgcg gaacttcaac ttctcacgat actgctcgac gatcccccgg cgttggtccg 19800 
cctcccacag gcgcacggcg ttaccgagat ttcgcaogct gatgcatgtg cgctcgttcg 19860 
tggactcgat ctcgatctgg gacaagaact cggacgcagg aaacatccgg cccccaatct 19920 
cra-^ccatc yctcgcgtac acgaatgtga cgccgttctt ctccatccag tgttgaaccc 19980 
atgtgcgcgg ctogaagcgg ttctcgaccc tcggcaacgc ctgcaccaat cggtcgatga 20040 
cctcgggctt gcagaccttc ttcctcttca aggtcccgac ccacttcttg aagatgccat 20100 
y< -. .-ggtgga gaaccaaacc gagcaccaga cgagagcgcg cttgatctcg atctccttca 20160 
cc^-accgng cgcctcatgc tgttccacga gccagtcaca gagtgccgaa ggaagcgccg 20220 
.;y._ y ,../. ~ va ygccaagtac tgcacgatca ggaactgacg agcccatgct tccgtcttgt 20280 
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gttccactca 


gcctcgttgc ggctggattc 


aacctcgttc 


gccttcgagt 


21120 


cgggcagtgc 


cttgtccgag 


tcctcatgtc ccattgggta 


cacctctccc 


atcaacgccg 


21180 


aagccgtgct 


tcgacgcctt 


gaaatgcagg gtggctcttg 


aacttgtatc 


gattctggct 


21240 


acgtcgttgc 


ggcgccctgg 


agcggagccg ggttcggtgt 


ggctacaagg 


aatatatcga 


21300 


atgtgtctgc 


gcacctcgga 


cggtgcgccc cgagggccac 


gggcatcaac 


gccgctgcgc 


21360 


gatcctgcgc 


cgtggacggc 


caggggattc gcgacgtaca 


ccggacggca 


accttccgtg 


21420 


tatgagccga 


ggcacccgct 


ggagcacgga gtacctggac 


gcgatgctcg 


cggcctggga 


21480 


gaggtccggg 


ctgacctcgg 


agcagttcac ggccacggcc 


gcgccagggc 


tgcctgcgcg 


21540 


taccctacgg 


ctgcaccgcc 


agcgaaaaga gaaggagatc 


cacgcagagc 


ttcagcggct 


21600 


tcgggatgaa 


aacgcgacgc 


tccggtcgat gctcgaaagg agcgcgttgg tccctggtgg 


21660 


ttgccgggcg 


gcaaccgtca 


gcgaccccaa cggccacggc 


attggagatc 


gccatgtggc 


21720 


aaacggaacc 


gaagcccctt 


cgccgaccgc gaccggcccg 


ccagcattgg 


agccggtaca 


21780 


gccgcgcgtg gtgccgcagg accaggttgc cggaccctcc 


cccggtgatc 


acgatggtcg 


21840 


tgtgctcgta 


ggggaaatca 


ccgagccggt gcccccgaag 


agaccggcgt 


tctcctggga 


21900 


ttgatggcct 


cgcgcctgct 


ggccgatgtc ggacgccatc 


taccgctggt 


cttggtccgc 


219 oU 


gagcgccttc 


aagagcttct 


tgatctcggc cgaggacatc gtgcggatgc ggtccacgag 


22020 


gcgttgccgc 


tcgtcatcct 


cgggaaacgt caggagatcc 


agcggcaaca 


ctcccaggtg 


22080 


gtccgccagc 


accttcaacg 


tgcgaaggcc aagctcctgc 


ggccagtgct tagaggtggg 


22140 
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actcagggca 


cggagcagag 


gcagagcgcg 


agcaccaggc 


aggcgctccg 


ccgcgctcac 


22200 


cctgccgagc 


cctgtggtcg 


cttgtaatac 


cctttgggtt 


tatgaatagt 


agacctcgga 


22260 


ccgcgagagc 


caccggcccc 


gtccccctca 


gccgaggccc 


caccagggag 


ccggaaggcc 


22320 


cgagccagtg 


cccgccgcgc 


gtcccttccc 


acctggaggc 


cgcacatgac 


cgatgacagc 


22380 


acgccgaccg 


acgaaacgca 


gatcgaggcc 


ccgacgccgg 


agaacacctc 


cgacctgaag 


22440 


ggcgcccgcc 


cccgactccg 


gttcaccatc 


gagaccacca 


cgtacgagac 


cctccgcacg 


22500 


ctggagcggc 


ggggtaacgg 


cgaggtggtg 


ctcctggccg 


agcgtcacct 


cccccatgga 


22560 


ctcgccgggc 


tcgtcaccat 


caagcgcctg 


cgcaatcccg 


tcactttcga 


gcgctgccag 


22620 


cggctgatcg 


aggaggtcca 


gctctccttc 


cgcctccacc 


acccggccat 


cgcccaggtc 


22680 


catcacctga 


aaatccatgc 


cgaccggccg 


cacgtcatcg 


ccgagtacgt 


ggacggcccc 


22740 


acgctggaca 


ccatcatcag 


cctcgccacc 


atgcgcgaga 


agccgctctc 


cgcgcccttc 


22800 


gccctctaca 


tcgccgccga 


ggtggccgac 


gccctccacc 


acgcgcacac 


gctgagagat 


22860 


tcggagaacc 


ggccgctggg 


catcatccac 


cgcgacgtgg 


cgccccggaa 


catccgcgtg 


22920 


gcccgcagcg 


gcgaggtgaa 


ggtgacggac 


ttcggcgcta 


cctactcgct 


catggtgggc 


22980 


agggaggaga 


cgccgggcct 


cctgctcaag 


ggcgacgtgg 


cgtacgcctc 


gcccgagtac 


23040 


ctgaaccgta 


agcccatgga 


tggccggtcc 


gacatcttct 


cgctgggcct 


cgtcctcatg 


23100 


gagatgctga 


cgtgcaagca 


cctcttcgac 


gtggaggacg 


aaaaggcccc 


caacgccacc 


23160 


gtggacgtga 


agacggagga 


gacgccctcg 


gtgcccctca 


cgcagatgat 


cgcgctcgtc 


23220 


aaccgctacc 


gccccgagga 


cgtggagaac 


gcgatggcgg 


gcctgccgga 


cgcgctcaag 


23280 


gccatcgtcc 


acaaggccct 


ccagcgaaag 


ctctccgagc 


gctacgccac 


ggccgccgag 


23340 


atgcgcgatg 


cgctgcgagc 


ggcgctcgcg 


gcggagtcac 


agcccttcgg 


ccggaaggag 


23400 


gcgagcgagg 


agctggcgcg 


gatgctgtcg 


gaggcgtccg tcctgcgcga 


ccgggtggaa 


23460 


ctggacgaag 


aaggcatctt 


ccccgagggc 


ctggacgccg 


acgagccgac 


gcccgcaccg 


23520 


aacgaagagt 


gagaacgggc 


ggcggtgcat 


tccttgaagc 


aagcgggccc 


cggtgtgggc 


23580 


ccgcgtcgtt 


tcagcgcaac 


aacacctggg 


ccaccttcac 


gagggcatcg 


agctggtcct 


23640 


cgtccatctt 


gcgcgccagt 


ccggtgagct 


ggcgcagctt 


gggcgccccg 


ccctgccctc 


23700 


gctccccctit 


gcccgacttc 


gtcccctcct 


ccgcgtccgc 


gatccccagc 


aactcgtccg 


23760 


aggagatgcg 


cagcaccgag 


cacatccgca 


gcagcgtctg 


gacgctgggc 


agcatcttcc 


23820 


cgcgctccag 


gcggctgtag 


accatgtgcg 


cgaggcccag 


cttctcggcc 


acctccgcct 


23880 


gcgtgagccc 


gagctgcgtc 


cgggcctcac 


gggcggcact 


tccaatacgg 


gtcgccagct 


23940 


cttcattcat 


gcgtcggggt 


accaggacac 


cgagaaccag gtttggcgcg 


ggcaggaagg 


24000 


cccaagcctc 


ggggacgccc 


cttcctgccg 


catctcggcg gccatgtcga 


tccacgagca 


24060 
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aaaccccatc 


a ccr ccr a ccr c c 


cctatt ggee 


ggacccatcg 


a a act ccrggaa 

^3333 3 3 3 


cctccagggc 


24120 


ca crcr ccaccc 


a crcr a t era a crt 

d y y d w y t* y v» 


c ca ccr crcr era c 

^— * V»f Sm*. \prf" y y 


aeggaggaca 


ccgcacaggc 


ccaccagcgt 


24180 


crcr cr era acre t cr 


nacacfcaacc 

y y * d y w w 


cr crcca ccr etc 


caggeggctg 


aa caeca gcg 


tcaacacQtcr 

3 3 3 3 


24240 


crater erect" r*n 


crcca cctcrercr 


cctcr ccracacf 


gcccgcccga 


taccacrcfccfa 

v- y y y "3 — -y y 


cgeggacatt 


24300 


crcrt t cccra t e 


tcr c t cr crcra era 

** y ^— y y y y *■ 


crct t ccr cert a 


cateracr era era 


cctacrcracraa 


aoaqqagaat 


24360 


crcccrccrteicrei 


ccr crca ctcccr 

uyy • d * — ■ v— \_* y 


a cr catacacrc 


tea cca crcr tt 


caaccratacrt 


atccgctcaa 


24420 


crcrtttaccct 


tctcccoccjc 


ca ct t ccr ccr cr 


crcr cr era a ct ccr 


tacgagatcc 


getaceggtt 


24480 


gagagcatgt 


ccaggaacgc 


cgaccccaca 


cgccccgccg 


aaeraacracrcra 


cacttgagcc 


24540 


eg cgt ct gca 


tcccgtcgtc 


cccccgggca 


eggaca t egg 


egggtacctg 


a t cr cr a cr cr a a a 

y ^yyuyyayn 


24600 


agctgggagc 


egggggett c 


ggtgccgtgt 


accgcgcccg 


ercCTtercracrcfcr 

y *— y y w.y y y 


egget ctacg 


24660 


cgctcaagct 


catcccact c 




ctaaataaac 

^3 y i-yyy\_ 


egagegegaa 


gtggccatcc 


24720 


t cctccggct 


caagcactcc 


a at ct era tcr c 


gcatccgtgg 


gcatggacag 


tggccggatg 


24780 


aggegect ca 


at ccttcttc 


at cert caterer 


actaccftcrcra 


a cr cr cr c cr c c cr a 


ttggacgta t 


24840 


cfcrcfccaoacfa 

y y ^-j * w< y ^ y »—4 


crcr a era a cccc 


tccrcrccccracr 

c» ^y y Vrf v»» y c* y 


a a crt cert crct 

owy ^- w y i~ y ^ u- 


caacrcrtcrccrt 

*w u y y i — y y 


cr cr c cr t cr cr c cr c 

km \a " y w y y y * 


24900 


crccrcrcrctcfcfcr 


cgccgcgcac 


c cr cr cr c cr a a cr cr 

*— *y y y ^^y ^* y y 


tcr crt crca t ccr 

^— - y y w y v^y 


acracctcraacr 

t-i y V-' ^ y t-i *-* 


craeracr caaccr 


24960 


t cat ccracrccr 


cacctccaac 


cr era cr a cr cr ca cr 

y y c* y ci y y y 


t crcr tcr crt crcr a 

w y y w y y ^ y y c% 


cttccrcrccrccr 

v.- ^ y y ^» y y 


cr cr c crcr cr t a c cr 


25020 


aoaciccfcccc 

c*. y ay uy ^ 


cancatcacc 


cr cr c cr cr c rr t rr c 
y y y y ^— y «— y 


L»y uoa^L-y yy 


<»a *— y 1 — • y y o. y 


tacuy uy uy i— • 


25080 


ccgaggcctg 


gegett ccag 


caggagcacg 


crcrcracCTacrca 

yyyauyayoy 


tggecact cc 


taccagcccg 


25140 


gcccct egga 


tgacctgtac 


tcactaaaca 

«-.^.yi".v-yyy 3 


tegttctcta 


ttggctcctg 


aegggcagge 


25200 


agcccttcct 


cccggacgag 


cr c c crca a cr aa 

3 3 ^3 3 3 3 


tcrcraacrcccrt 
^yy^yy ^^y 


gctcaaccgc 


gcccccaaac 


25260 


caccccatgt 


gctcaatcca 


cgcgtccccg 


aaaccctaaa 


tcr cccrtatac 

3 3 3 3 ^ 


atgcgcctgc 


25320 


tcrcrccaacrcra 


acccaacrcracr 


cgccacccgg 


acgccgacgc 


actcrtcfccfca 


cr a cr c t cr cr a cr cr 

y a. y oy y a y y 


25380 


ccctgctggc 


ccaaacqaac 


cracftcctCTaci 

y w y >w «_ <_ y y y 


aegtgaaact 


ctgcgacgcc 


t a t cr cr c c ccrcf 

l— t-A ^ — y y v — - <w» y y 


25440 


acaccgccac 


cacgctcgcg 


gcggtgccgc 


aegegggega 


acraccracrcta 


rftrrraatrirrr 

y y v^d ci uyy V/ 


25500 


tgaagaagcg 


caaaacccqq 


cct cgt eggg 


gg cca cgt cc 


gccacacggc 


cracr crcr c crca c 
y y y y y wa w 


25560 


aegagcegga 


ctcaaacgcg 


gtggccatcc 


aggcagagct 


gccaccccca 


crtt cctcr etc 


25620 


gccccggccc 


catgccctcg 


ccgcccccgg 


cacaagccgg 


tcacttcaag 


cfccct* cr ccr tcr 


25680 


tggcgacctg 


gatgggtctg 


ctggtggtct 


tgagcgcggg 


cgcgctggtg 


atcgcgcgca 


25740 


ccttggcgcc 


ttcccctgct 


cctgccgcac 


gtcacgccct 


ggagcctcac 


cccagtgatg 


25800 


ccgtggactc 


accacctgca 


tcccccgagt 


tcgacctcgc 


gccctgggcc 


cccggccagg 


25860 


aagtggcggc 


gccatggatg 


geaceggaag 


ctcaggaagc 


cgcagccgcg 


etcgatageg 


25920 
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cgcccaccct agcggccgtc gccctccccg cgacgatctc cgaggagaag gcttccgtga 25980 

agatgaagaa gaacaccggc gtattctccg agcccgagcc gcagcgcttc cggggcggaa 26040 

ccgttggcaa ggcgctcggc ctcggtgtcg cgggctgcct agcgatggcc tgccctggcc 26100 

cccaggtccg cccgacgcct ccgccagagg cgtgcccgcc aggtgctgtc gaggccatga 26160 

gcaggcttcg aattgagtat gacgaggatg tggtagccac cttcttcgcg accgccgagg 26220 

agggggttag caggaacatc acggtgcgcg cgggcccgac aacggttcac ctggggagac 26280 

cattgggaaa tctaccggcc cggaccacgc tctccggacg cctcatcatc ggagaggatc 26340 

gcgtttacgg ccgactaact gaagcacgca cgccaaaggg tgaccgcttc cctgtatgca 264 00 

tgcagatcat tgagaacgac tctattgggt tcaagaagga gatcaacgac ggctccaaca 264 60 

gcgcctcggt ctatccctat tttgacgtga aggccgtccg ccgcttcgag tgaaccccga 26520 

ccaggagcaa catggggctg aaccaagtat caatgcaggg ttccactctg ctacttcaag 26580 

gtctccgcgc atgcccatgt cacccatcgt tgttctcctg gtgatagcgc ttttctccgg 26640 

ggcagccgcc gcgcaatcgc gtccagcagc cacgggcttg ggggtacgcc ggattgagtt 26700 

ctcggccgag aacttcgagg caatgacgac accggaggta cagataagcc cgggggtttc 26760 

caccacgttc gagttcaact ccgccctcct tcaggagaag gtcgcggttg aaggcgccga 26820 

ccgcttctcg ctcgtggaca ttggacgaag cactctccgg ctggtgccgt cggagcaagt 26880 

cctgcccggc gaacgcctgc gagtgacggt gcggttcctg gacggagcag cgccggtagg 26940 

catggcattc atcctcgtcg cgcaccccgc tcaggcggag cgcttggtcg aggtgtggcg 27000 

caaccagcgc acggtggagt cctatcaaca ggaatcgaaa gaagcccgag cggaggccca 27060 

aaggtgtcac gaagagaacg agcgcctgcg cgccgagtat gcgggaccgg gcggcctcgc 27120 

tggcctcctt gcgaaccgcg tgatcggcaa gtcgggcgtt tcggtcaagc ccctcgactt 27180 

cgagaaggaa gtccgccagc gtcctggaga cgcgatcagg attcttcgtg catggagcta 2724 0 

ccgctccgcc aaccgcgtgt ccgtgctcat ggatttggac tacccagaag ccgcgcaacg 27300 

ctggatcgcg cagggagcat cgttggtgag caagacaggt gaaacgctga gcatgctccc 27360 

tgtatggcag gaagcgcccg ttgccaatgc ccggatccgg cgcctcgtgg tggaagcaga 27420 

ggctgagccg gacgcggccc aggacacgtt caccctcaaa ctgtgggaat cgaacgggct 27480 

gcgcacgatc accctatccg gggtgacatt cccataatgt tgagaacatc ctcactatca 27540 

agctagcacc cacatagcat gaagtccgcg cccgtgcgtg tcatgggccg ttcaaacacc 27600 

tgaaaaccaa taactacatc cttacaccat ccaaaccaag ccccatgtct acagccgagc 27660 

aaacgaagaa aataatcaag aagctgcgag ccaccctaaa cgaaggaagc tcagaaaaga 27720 

ttcaagcact catccagtcc atcagactgg agaatccatc tcagtggctg aatttgatcg 27780 

ccctcgcact gggccccata gatcctagga atccaacatt cttcgaaaaa gtatccgcaa 27840 
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taaacagatt ggcagaagat cttgcgccag aagagcagca cacgctacgc ggatacgaaa 27900 
gcatcgcaag ggccatgcag atcgtatcgc aagagtccga caaatcagaa ttctggaaat 27960 
cttcgcgcct cacaatgatg tggcgcctat ttacgtacgt tgaatcccaa tacatatttg 28020 
ttagcaatac gattcgaaag aagctcaagc gggccccacc gggctcgcca tttgtcatgg 28080 
cgagcccttc aatcgcggtt ccaggaaccg atgggaccgc aacgacaagt gtcgatcttg 28140 
tttttgaatc taccgccgaa gccgccgaac taatactccg catctccatg cacaggatcg 28200 
acacccaacc cccaaatgaa aagtttaccc cccagacccc ttatgacgat cctgactttg 28260 
gcaggctcct cgcttgcgca cacgcgtggc atcgattcag cctgctatgg aaccaaatcc 28320 
agtataccga ctggcgacct gagcatgatc cacgcggctt catctggaca ccaaaaaaca 28380 
aggcgcaata cctccaaagc gaaatcggaa acattcgcga gcgcctattt ttgcagcaag 28440 
caatgctgtc tgatggactc ggtcctgggg cagaactgga gagtagaaga gaagtcctgg 28500 
aacgcctcaa gaggagcttc aatctcccga agccagggaa aacatgggac cttaagattg 28560 
atattaaagc cctgcgagat ctcctcggtc ttgatcgata tgccacggct cgcaagatct 28620 
acatgtcgcg tcgattccta accccagtaa tgggagcgcc agcagatcct gaactggaga 28680 
agtcacagaa ggagttgaag ttgaacttca cgctgcatga actcgccatg gctctgctgc 2 874 0 
aaaggaatca cgcaacattc aagaccaagt cagcaataca caacattgtt tttgcagtca 2 8800 
ccaagcagcg cctatgcaag gtaatcagcg aggcgtcggg catcgaattc gaattctgct 28860 
tttcccatat caacaacatc gtcttcgacg cacagagaaa aaacctagag atctgggacc 28920 
agccgctcgt accagtagat gaccataggg tgttgttcgt tccttcgatc attgccaatt 28980 
cccatcctat tcgcacaata gagcatttag cttcagctcg ctcaagctcc gcgtttgact 2904 0 
cccggggaga gtattatgag cagatgattc aatcgaggct tgaggaatta acacaagcag 29100 
tcgctgtgcg gggcatcaaa tttcctgctt cagatgggcg gcagatcgaa tacgacttgg 29160 
tgctttattg gcaagggcac ctatttatct tcgagcttaa gtgcctgaag atgacaacaa 29220 
ctcctgtgga tgagttcggg gcgagggagg agatcgagcg cgccgagcac caattggtac 29280 
ggcgtgtttc tattgccgaa actgactgga aggagttcag gagccaagca cagggagttt 29340 
cactccccga gactccgcca gcgcgcagaa acatccatgc cataatactg accaacatca 29400 
tgaactttac gggccgaaga actggagaga tcgtgacggc agacgactcc tgcctgttta 29460 
gatacttcga cagccctcat gttcccgtat ttgatgctgc cacgggtgcc gaaatctcaa 29520 
gcattgcacg tttgcgcctt cgcaacgagg ggccgccatc tgcttcgggg ctttggcaat 29580 
atttatcaag ccctccgcag ttggagtgga tgcggaagca aattcggcct gaataccacc 29640 
cactgccacc gctggatgag ggcaactatc actggaagct catggtcgca ttcgatccgg 29700 
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ccgacgccca gacgactaaa tagtgttggt ttggtctcgt tgcgaaggtg agcattgacc 29760 
tcggctggac gctggattca gcgtctacaa caacgatatt gcagggtgag gcggaggctg 29820 
cgtagccttc gtttagaacc cgtggagaac ccgacccatg acgctatacg aactgtttga 29880 
agccgatgat gaccgcgatg tccccgccga ctttccttgg ctcttgatcg agatccctga 29940 
ggagcaactc cgccatgcgg cgcttctcct gaatgagggt gagtggcgac cctcgcacat 30000 
ttctggcatc tggtatcggg tggatcccga gcgcccagct cagaagcagc agcggcacgt 30060 
ccacgtcgcc gcgaagaagc acatcaagat ccccacgaag caggcttcat ggaacagaga 30120 
caccacccgc catgaccgca agacattcaa tgcgaagctg ggctcgcaag ggagctatca 30180 
agacgtggcg aagacggcgc tgggcctgcc gcccgaggct gtcctcgaac acgttgtgga 3024 0 
aaatccggcc gaacaacaga tgctgctcac cgaatccact gatccccagt tgaaaactga 30300 
gttctacagg tggttggagg cccagcccag gcggtctaac ttcgaagcgc tcgtcgagga 30360 
cgcggttagg ctcaatctgc cgctcgacct ggaatcgtaa ggcagtggtt aaatgcggtg 30420 
atacgagcgg tgatacgagc gcctaacaaa actggcatag ggacccggct tctcaaggac 30480 
gcagctgcaa ccaaggcgat tttgtaggca ttctttgggg cccgagacag cgcgcgggat 30540 
ctccggcacc cagttcacgg acgtcacogt ccccggtgca gccaatggcc cctgacccct 30600 
gccgtcccca ccctcgtacc caagctgtac ccaagatgaa cggacgcggg tggacctcca 30660 
tggacgctcc acccaccccg ctccagaggg aaaccggccg accctcaatc cctccgggca 30720 
cctcgcggat cgggggagcg gggcttcaaa accggtgagg ggcgctgaga ggcgtccctg 30780 
gtgagttcga ttctcatggg gcgacgtgaa cccggccgtg ctccatgcgc tccacaacgc 3084 0 
ggcggaagat gaggtcgata atggcgaggg actgagcctt caggagtggc cggaacagac 30900 
cagggccacg tcagaagcga acgacctccc tttcgtaacc gaataacgaa acagaggcga 30960 
ggatcacaac tttcgaaatc cgttgacgcc cgtgggcttc atttcaggaa tcctgggaaa 31020 
tgcctctcct tgctcttctt gggcttctcc atggtagcgc ttcgcgcacc caggctctgg 31080 
tactatccat catatggagc gctccctcgg aacgacagag caggtcttct ataaggtacg 31140 
gcagaagcgc cccttcaaca tcgtcattac gggcaagctg cgaggacgcg tggaggagca 31200 
ggctgtgcgc caggggttgg tggccctgca acggcgacac ccgctgctcc gagcgaagat 31260 
uy^uouggt actccgccac gcttcacctc gcaaggtgtg ccccccattg cgctggaagt 31320 
ggtggaacgc aaggcagagg agtcctggca acagcatgtc gaggcagagc ttgacaggcc 31380 
tttcaactcg gaccctgggc cgctcgtccg cttcatcctc gtgcggggcg agcaccgctc 31440 
™-octgetg tgcagctatg atcatctgat cggagacgcg cactccggca tcttcgccct 31500 
gt-yu.gar.ccg ctgcaggtca tggcctccca ggaccagcac ctgcccgagc tcgccccaag 31560 
•j-.uv y .-- gaggagctga ttgggccgat ggtaccgggc acccgactgc ttggcgccgc 31620 
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cgtgcgccgg 


ggctcctcgg 


cgctgctgcg 


cttgtctccg 


accctcaata cgctcacaga 


31680 


gcggctcgtc 


gcaccgcggg gcatggctcg 


cagcagcgcc 


gagggacagg cgctctacag 


31740 


ccaccgaatt 


ctggagcccg agcagctcgc gcggctgctt gcgcgctgtc gagagcaagg 


31800 


gtcctccgtc 


cacgcagcgc 


tcggggccgc 


gctgctgata 


gctcgggccg agtcgcaggg 


31860 


tgccaagaag 


cgggtgacgc 


tcaccctcac 


ctctgcgttg 


gatgctcggg agcgctttgg 


31920 


cgtgggtgag 


gacttcggcc tgttcaccac 


ggggaagacg 


gacttcttcc gcgcgcgcgg 


31980 


cagcacccca 


ttctgggagc tggcgcacag actccgatcg 


cctctgcagg cagcacggaa 


32040 


gaagcggagc 


cacttgaggc 


tgttccgcct 


gatcctggag 


atatccgcgc tcaccgtgga 


32100 


catctcctcc 


cagccctgga 


tggaacgcgc 


gacacggctg 


ggcctgcact ccatgctcgc 


32160 


cctgagcaac 


gtggggcgtg 


tggagattgc 


cgcccgctat 


gggaacctga cgctcgaggg 


32220 


ccttggcttc 


acggggacga 


ccgcggccca 


gttcgacgtg 


gtgctgacgg cgatcacctt 


32280 


tgcccagcgg 


ttggaggcca 


gctttctctt 


caacaccgcg 


cacatgcagc gcgagcaggt 


32340 


ggagcagctc 


gcctcgcgca 


cctgggagct 


gctggccgag 


gcgacgcgct gagcggctac 


32400 


tccggctgga 


tgttgtacat 


ccgcttgacc 


tcttccagcg 


gcagctccat gaccttccag 


32460 


ggatcccagt 


ctccgaagag 


gtccacgctc 


atcgcgctgc 


ctcgctccac cgcacgccgc 


32520 


agctggggaa 


tgggaagcgc 


ctcggcgagc 


aggcgaacgg 


gcaccaagga cttgtactgg 


32580 


gccggatcat 


agttgcgggt 


gcgggggccg 


aacatgaagc 


ccaggctgaa ggccgtggtc 


32640 


agcatttcct 


cccagatgga 


gatgccatag 


ccggagagca 


ggtggacgaa gtcatgcacg 


32700 


gccgagtgca 


cgggcagggc gtgagggtga 


cccggcagcg 


gcacgccatg ggagtggcag 


32760 


tacttccaga 


actcgcggcc 


aaaggaaccc 


tcgggcaggt 


gctcgagctg ctgaaagcgt 


32820 


tgcttgaggt 


ctggcgcctc 


attgcgctcg 


cgcagcatgt 


catagagctt ccagggcagc 


32880 


tcgcccttcg 


gcagccccaa 


catgcggcag 


aagtactcaa 


tataggcagg cacgaggaag 


32940 


gcggtcagtc 


caacggacag 


ccacagcttg 


gagcgcgcgt 


ccgcggccag ggaacgctcg 


33000 


cgcggtccag 


gggctcgctg 


gtacttccgc 


agaaatacat 


ccgtgagtat tctcattcga 


33060 


acaccttggc tcgttatacg gctggggcag ctttcaggca 


ggtcgctgtc tcctggagaa 


33120 


agccatacgg atccttgggg tcctcggcca tggactccac 


ctggagcagg cggcctaccc 


33180 


ccagccgctc 


ggcgtgcgcc 


aggatgactt 


ccagcaccac 


cttgtcctcc aaggaaaacc 


33240 


cggtggagtc 


gaagacggtg 


aggccatcgc 


ggtgagagcg 


ccactgctcg ggagagcggg 


33300 


tcagttcaaa 


gaggctgggg 


ccaatctctc 


caggctcgag 


ctgctggcac tccccctcca 


33360 


ccacggcctg 


ggggagatag 


tccgggcata 


ccagggcgcg 


acgcagcatc tccaggggca 


33420 


gctctgtctt 


gccgggcaga 


tctgagccca 


cggcgttgat 


atggaggtgg gccttgatca 


33480 
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gacgcccgga caccaccggc cccgcgccca cggcgacgga ggtggccgtg cagaggatgt 33540 
ctgcctcctg ctccacgcgc gtcaggggga ccgccaccac ctccaggccc aggaaggcca 33600 
cacgccgggc aaagcttcgc atggcttcgg gagcaatgtc gtgcaccagc acccgtccca 33660 
gaggaaagac gcgcgagagc gcatgtgcct gggtgacggc ctgagcgcca caccctacca 33720 
ggcccaatac ccggctgtcc tcccgcgcga gcagccgact cgccacggcg gaggccgccc 33780 
cggtgcgtag cgcggtagga aaggtgccat cacaaacggc ccgcaggtgc ccggtcgtca 33840 
catcatacag actgagggag gaaagaatgg tgggcaagcc cagctcctcc gggctgcggg 33900 
ggttgtagcc caccagcttg atcacggcgt ggcgccccac ctgcatgacg ggcatccact 33960 
ccaacacgcc ggggtggggg tgctgatact ggaagccctc gcgcttgcgc agctcggtgt 34020 
tggcggggtc gaagtgctcc agctcgtact cgagggtgga gatgacctca tccatcagcg 34080 
cgtgcagccc cgttgcacgg acgatctggc gcagttcctg ctcacccagg atccttgtca 34140 
tccgcatggc tgaccccgcc cctttcagct gcatgacagg cctcactgca ttcacggctc 34200 
cagttcaggg ccgggcaaaa tctatcttca acatcccctt gtccaggacc gcgtccacgg 34260 
catccagctg cgccaccgag cgcggcagca tgacgtggcg gcggaagccg cccacgcgga 34320 
tgatgaggtc attgccctgg cgatccagga caatgtctcc gcggtcggca cctggcatga 34380 
ggagctgcag ccgatagcgg ccctcgtgct gcttggcgaa ctgatagggc tgcccggaga 34440 
tgtagcgctc ggtggggtcc gagtccccgt agagccgctg cgccagatcc tccaaccgct 34500 
ccagccccac cacctcgtgc tccagcagcg ggagcctggc caccggcatg ggggagaagt 34560 
actcctggat gtgatccagg taggcagatt gagacttgtg ccactgagca aagtattccg 34620 
tgtccggcag cagacgattg accacgacct ggtccaccgt catgccatac atgttgaagt 34680 
agaggtaggc ccgctgtgtc tcgcggatga ccattttttc cgcgctggag accaatcgca 34740 
ccgtggtgcg attgccatcc gtgagcaacc cttcaatgcc cttgacgcga tcgaagagct 34800 
gctcgaggga ggcaaagtag ctgtcctcog gcagatcata gctcgtgagc cgactggcca 34 8 60 
ggggccgggc cagcttgacg agcgtgcgat ccacattgaa gcgccggcgg atataccact 34920 
gcagcacgct ggtgatgttg acgaagcgca gcgactcgcc cgtgggcgga cagtccacga 34980 
tgatcacgtc atagcgcccc tgctgcacgt actggttcag gtagatgagc gagatgacgt 3504 0 
cctccgtgcc cgggagaatg gcgacctcct cggccaccat gttggatacc ccggtggagg 35100 
tcagcagcac ggacatgtag ttatagactt cactccactg ccgctggagt tcgtgctgga 35160 
tgtcgatctc ttgcagttca aggttggggg cgaccttctg gggcagtccc tcgttgaagt 35220 
cgaagagctt gcgatcgaga tcgaagctgt ccgagaggct gtgggcaatg tcgaaggaca 35280 
rcaccaaggc gcggtagccg cgcttggcag cggccaccgc ggtggccgcg gatgccgtcg 35340 
LuLLyw-c-L-s. ^ccacccttg ccagagaaaa gaagaattct tgtcatggat gccccgcgtg 35400 
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cctgccggca 


acgggccgcc 


atggcgctct 


ccccctccat 


gggtcacctg 


ttgcgtctgt 


35460 


ggaactcatc 


actatcagcg 


acatgccaga 


aggagcaagc 


tggacggtgg 


cttgacttgg 


35520 


tatgacgctg 


gacagtaggt 


atgccaatag 


aggtgggggg 


ctttatgaaa 


tcgatggatt 


35580 


ggctatcggg 


cacagccccc 


ccccttgttc 


aattccagga 


ggggtgcaat 


tgacagccac 


35640 


agcgactgcc 


gagcacgcgc 


tgaaagccct 


ccgggccaag 


ggactcgcca 


cgtcggacga 


35700 


ggcgctgccc 


gcggtgcggc 


acgccagcgg 


cgaaaaggct 


cccctctcat 


cctcacagcg 


35760 


gcgcatgtgg 


ttcctggagc 


agctcgagcc 


aggcaacgcg 


gcccagcatt 


tgttgcaaag 


35820 


tcatcacata 


caggggcctt 


tgcaggtaga 


ggggttgcgg 


cgggcgttgg 


ggctcatggt 


35880 


caagcgccat 


gaggcgctcc 


ggctggtggt 


tgccattggc 


gcggcgggcc 


ctgagcagcg 


35940 


tgtgcggccc 


gagtggtggc 


ccgagctgcc 


tctcagcgat 


ctgagcgggg 


tcgcccaggc 


36000 


ggacagggct 


cgggccctgg 


cggagctggc 


gcgcgtggag 


gctgcggcgc 


ccttcaacct 


36060 


gcagcaaggc 


ccactgttcc 


gagtccggct 


ggcgcgactg 


gcggaaacgg 


agcatgtgct 


36120 


gctggtcacc 


ctgcatcacc 


tgatctcgga 


tggagcctgg 


agctgcgagg 


tgctcatcaa 


36180 


agagctggcg 


atgctctacg 


gccagcacgt 


ggaaggctcg 


agcccggagc 


tggcgccact 


36240 


gcccgtccaa 


tacagggact 


acgctggctg 


ggaggcaagc 


ttcgcgcccg 


cgggagaggc 


36300 


cttggaggcc 


tggtggcggc 


agcggctcgc 


cqqggt.qc.cc 


acggtgctgg 


agctgcccac 


36360 


ggagggagcg 


cgtcccctca 


gacagacgta 


ccgggcaggg 


cgggtggcca 


tcaccgtgca 


36420 


gccgcgcctg 


cgccaggcgc 


tggaggacct 


ggcgcacaag 


gagggcgtgt 


cactctttgc 


36480 


cctgctgctc 


accgccttca 


ccacgctgct 


acaccgctac 


tctcgccagg 


agqaattqqt 


36540 


ggtggggtgg 


cccgccccgc 


agcgcccccg 


gcccgagctg 


catggcctca 


ttggatactt 


36600 


cggcaccccg 


gtggcattgc 


gctcgcggct 


ggagcctcac 


acccgggtgc 


gggaggcgct 


36660 


gcggcagctc 


gatcaggagg 


tgcgtgaggc 


caacgcccat 


gcggcgctcc 


ccttcgagcg 


36720 


gctggtgagc 


ctgctagaca 


ttgcccgcag 


ccccagccgc 


cacccgctgt 


tccaggtgct 


36780 


gttcgacctg 


ctgccggagc 


agccccagcc 


taccgctggc 


ggctgcgcct 


tccggccctg 


36840 


ggagtccttc 


accggcctgg 


tggcgtatga 


tctcactctg 


ctgctggagc 


cgcgcggtga 


36900 


ggggctggag 


ggggcgctgg 


actacagcgc 


ggacctgttc 


agcgaggcgc 


gcatgcagcg 


36960 


cgccgccact 


cagtacctgc 


acctgctgga 


gcagctcgtg 


gagcggcctc 


acgagaggct 

3 3 3 3 


37020 






cgcctggtga 


gcacgaggct 


ttgctggcgg 


ggcacggcct 


37080 


gggcgggccc 


gtggagggcg 


cgcaggtgct 


ggcccaccac 


cgcttcgagc 


accaggtgcg 


37140 


gctcaccccc 


catcaccccg 


cgttgtgctt 


cggcccgcag 


gtgctctcct 


acgagcagct 


37200 


caaccgccgc 


gccaacccgc 


tcgcccaccg 


gctgcggcgc 


ctgggcgcgg 


gcccggacac 


37260 
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cctggtaggg ctgtgcgtgg agcgctccct ggagctgccg gtggcactgc tggccatatg 37320 

gaaggccggg gccggcttcc ttccgctgga cgtcaatcag ccccgcgagc ggctcgcctt 37380 

cctcctggga gatgcgagct gccgcatcct cctcacccag gagcacctgc tccagcgcct 37440 

gccccccacg aacgccgcgc tcctgtgtct ggagagggag gcggaggcac tggagcgcga 37500 

gccccaggag gatgcgccac acgaggccgg gctcgacaac ctggcctacg tcatccacac 37560 

ctctggctcc acgggcacgc ccaagggcat cgcgatggtg catcgctgtc tggccaacct 37620 

ggtggcctgg cagctcaccc atgagcggct gggtggcccc tcacgcacgc tccagttcgc 37 680 

ctcgctcaac ttcgacatct gctaccagga actcttcacc acctgggccg ctgggggcac 37740 

cgtggtcatg gtgacggaag aggtgcgcag ggatcctgcc cggctgctgg aagtgctgga 37800 

gcaggagcag gtgagccggc tgtacctgcc cttcatcgcc ctgcagcagc tcgcgcgggt 37860 

ggcggatgag cgcggcgcgg cccctcgcca cctgcgccag ctcattaccg ccggggagca 37920 

actccaggcg acccccgagt tgcagcggct gctctcgcgg atgccggagt gcaccctcca 37 98 0 

caaccagtac ggcccctccg agtgccacgt ggtgacgagc catgacctga cacgcgagcc 38040 

gagccgttgg ccacggctgc cgcccgtagg ccggccactg gcccacctcc gggtgctgtt 38100 

gctggacggg gagcagcagc tcgtgcctcc gggtgtggcc ggagaggtgt tcctgggcgg 38160 

ccccgccctg gcgcgcggct acctgggacg tccggagcag acggctgatc gcttcgtgcc 38220 

cgatccattc tcccgcgagc ctggagcccg gctctaccgc accggggacc tggcgcgcct 38280 

ccgagaggac ggagcgctgg agttcctgca acggatggac gcgcaggtga agatccgggg 38340 

ctaccgcatc gagcctgggg agatcgaggt ggtgctgtgc gagcaccccg ccgtgcacca 384 00 

agcgcacgtg cggccctacg tggacagcgc cggagagcgg cgcctggtgg cctacgtggc 38460 

cgcacggctc gaggacacgg acggggcgga gacagagcac gtggagcgct ggcgcgcggt 38520 

gtgggatgag acctatggcg gcccaagcgg cacggccttt gacctggccg ggtggaatga 38580 

cagcgtgcgc ggcgagccgc tgcctccgga gcagatgcga gagtgggtgg agacgacagt 3864 0 

ggagcggctc atggagctcg ttccccggcg ggtgctggag ctgggctgtg gctccgggct 387 00 

cctgctccgc cggctcgccc cccgctgcga gtcctactgg ggcacggagc tctccccggt 38760 

agccgtcgag cggctgaggg agcaactcca gacgggcggc tccccgctcg cgcagcgcgt 38820 

gcggctcatg gcccagccgg cggatgactt ctccgggctt ccggaagccg gcttcgacac 38880 

cgtcatcctc aactcggtga cgcagctctt ccccagcgtg gactacctgo tgcgggtcgt 38940 

ggagggcgcg ctgcgcgtgc tccaaccggg cggaaccctg ttcattggcg acgtgcagaa 39000 

tctgcggctc ttcgagctgt tccatgcctc ggtggccctg gagcaggcgt cagcggacct 39060 

ggaggcgcca gcgctgctcg cccgcacccg ccagcgcatg ctgctggacg aacggttgta 39120 

cgtggacccg gacttcttcg ccgcgctcgc cacgcacttt ccccagctgg gcgcggtccg 39180 
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gctgcacctc aagcgcggga gcgggaggaa cgagatgaac cgcttccgct acgacgtgga 3924 0 
gctgcagctc gcccccgtag cgaaagcagg ccccgcgcaa gagctgccag agctggactg 39300 
gaggcatgag gggctcggcc tggaaaggct ggagcgcatg ctggcggagc gaccggcggg 39360 
gctggtgctg cgcaatgtgg ccaacgcccg cacggcggac gaggcggcac ggctggcgct 39420 
gctgcggact ggcaacagcg tgggccggct gcgggcactg ccggcggtga catcctggaa 39480 
tccagagcag ctctggaggc tggcggaggc agccgactac acgtgccacg tcacctggag 3954 0 
cgcccaggac gaggaaggcc gtttcgacgc gctgctcatg gcccgcgcgg ctggctcctc 39600 
gcgcccggcg gcctggctga cgcctccccc gcctccccct cgaccctgga agagctacgc 39660 
caaccagccc ctggcggcca gccgccggcg caccctcgtg ggagtgctgc gaagccacct 3972 0 
cgagcacaag ctgcccgagt acatggtgcc ctcttccttc gtgctgctgg acgcgctgcc 39780 
gctcaagccc accggcaagt tggaggtggc ggcactgcct ccccccgagc ccgcacaagc 3984 0 
cgagcaaacg ccaggccacc tcgcgccgcg gacccccacc gagcgccggc ttgccgagct 39900 
gtggcggcgc gtgctcggcg tgcatcgcgt gggggtggag gacaacttct tccagctggg 39960 
gggccactcg cttctggcca cacggctctt gtcgctcatc cgcggagagc tggggctgga 4 0020 
gctgcccctg cgggtgctct tcgagcagcc caccctggcg gccatggcgg gttgtctcga 40080 
ggctcagagc tggagcaccc aggctcccca ccctccctct gcctccatcg aggagggaga 4 014 0 
actttgagcg ccggagcgtt gctcgcccac gccgcgagcc tgggtgtacg cctgtgggta 40200 
gagggcgagc gactgcgctt ccaggctccc ccaggggtga tgacgcccga gctgcagtcc 40260 
cgcctgggag gcgctcgcca tgagctcatc gcactgctac ggcagctcca gccctcctct 40320 
cagggaggca gcctcctggc cccagtggct cgaaacgggc ggctggcgct ctccttcgcc 4038 0 
cagcagcggc tgtggttcca ggagcagctc caccctgagg ccccggccaa caacctcacg 4044 0 
ggggccgtgg tgttcaccgg cccgctgcac gtggccgcgc tgctcggggc ggtggctgcg 40500 
ctggtgaggc gacacgaagc gctgcgcacc accctgggcg aggagggcgg agtcccctat 40560 
tcgctcatcg gcgagccgtg gcagccggcg ctggaggtgg aagcgctacc gggtgccacc 40620 
gtgggcgagc ggctggagca ggcccgagag gtggccctgg ccgagtcccg gcgccgcttc 4 068 0 
gccctggaga cggagcccca cctgcgcgtg cgcctgctgc gcctggccga gcagcagcac 40740 
gtgctggtgc tcagcctcca ccacatcgcg gcggatggag tcgggctgca ggtgctcgaa 40800 
caggagctgg cggcgctcta cggcgcgctg agcgctgggg ccgagccgcg gctacctccc 40860 
ctgcccctgc aggtggcgga ccttgccgac tggcagcgcc ggtgggtgga gggagaggag 4 0920 
taccaggtcc agctcgctta ctggcggcga cagctcgccg ggctcacgcc cctggaggta 40980 
ccaggagatc atcctcgccc gcgtatcccc tccatgagag gcgcggaggt acgg gcgccc 4104 0 
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ctgctctccg ctccgcaagc ccaggtgctg cgcgcgctgg ggcaaggcga gggcgccacc 41100 
ctctacatga cgctcctggc cgccctgggc gtgctgctgc aacggtggac gggccagcac 41160 
gatatggcgg tgggcagcgc cgccgccaac cgcaaccgcc cgggcttgga gggcatcctc 41220 
ggcttcctcc tcaacatcgt cctgctgcgg ctcgacttga gaggccgccc ccgcttccgc 41280 
gagttgctgc gtcaggcacg cagggtgtgt gtggaggcct atgcccatca ggagttgccc 41340 
ttcgagcacc tggtggaggc gctccagcca ggcagcgagc gcggggacag ctcgctctat 41400 
cgcgtggcgc tcgcggtgag cgacacgcca tggatgcccg gccacgggct gaagctggag 414 60 

ggggtgcaag cgcagccgct cgactttccc cggggggtgc tggacctgga tctccacctg 41520 

tgggtgtatg acaccgggga ggggctcacc gggcggttgg agtacgccgt ggatctgtat 41580 

gaggagccca cggcaaggcg gctcctggag ggcttccgcc aggtgctgga ggcggtggtg 41640 

gaggcgccgg atcggcccgt gccggagctg ccggtgctgg gagagcagga gcgccaccag 41700 

gtgctctccg ggtggaaccg cacacagcgc ccctatccgc gagaggccag cgttcacggc 417 60 

ctctttcagc agcgtgccct ccaggccccg cgggccgtgg cagtggtgta tggcgagcgc 41820 

agcctcacct acggcgagtt ggcggagcgg gcccgcgggc tcgcacaggg tctggtggcc 41880 

cgaggcgtgc ggcgaggaga cctggtcgcc cttcggctgg agcgctcgcc ggagcaggtg 41940 

gagtccatgc tggccgtact ccaagccggg gccgcctacg tgccgctgga tccttcctac 42000 

cccgtacagc gccaggagtt catgctccag gacagcggcg cgcggttgct cgtgcacagc 42060 

ggcccgctgc ccttcgctcc ccagggctgc gccacgctgg atctccaggc atggcacccc 42120 

gctccctccg acggtggcga gcccctgocc cagtgcagcg gagaagatct ggcgtatgtc 42180 

atctatacgt ccggctccac cggccagccc aagggcgtgg ccgtgtgcca ccgggccatg 42240 

acccgcctgg tgtgcaacac ggactacgtt cagctcgggc cagaggatcg ggtcgcccag 42300 

gcttccaatg cctcctttga cgcggccacc ttcgaggtct ggggcgcgct gctcaatggg 42360 

gcacggctgg tggggctcgc cacggaggag gccattcagg cgcgccggct ggccgaggtc 42420 

ctgcgcgagc agcggatctc cgtgctcttc gtcaccaccg ccctcttcaa ccacgtggcc 42480 

cgggagcaac cccaggcctt cagcaccctg cgctacctgc tctttggcgg cgaggcagtg 42540 

gatgcctcca gtgtgaggcg ggtgctgaaa cagggcgctc caggacacct tctacacgtg 42600 

rar "" rC££a "rgagaacac caccttctcc accgcctggc gggtggaaca cctggccgag 42660 

caggcgcata ccgtgcccat gggtcatccc atcgccaact cgcggctgca cgtgctggac 42720 

gaagcgctgc agccggtgcc ggtgggcgcc atgggcgagg tatacctggg aggtgatggg 42780 

— rgrgctgg gctactggcg ccaccccgag gccaccgcgg agcgcttcgt gccggatccg 42840 

cacggccrgg agcccggagg ccggctctac cgcacgggag atctggcgcg gcggcaggcg 42900 

Sacggggccg tggtgttcgc tggccgcgtg gacaggcagg tcaagctgcg aggcttccgc 42960 
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gtggagccgg 


cggagatcga 


gtcgcacctg 


tgcgagcact 


cggaggtgtc 


cgcggcggtg 


43020 


gtggagctgc 


ggggcgaagg 


ggccctgcgg 


cggctggtcg 


cctatgtcgt 


cccccgcgcc 


43080 


ggtggccgcc 


caggtgccga 


ggagctgcgc 


accttcctcc 


gcacccgact 


gcccgagtac 


43140 


atgctgcccg 


ccagcttctc 


cctgctggag 


gcactgcccc 


tcacccccaa 


cggcaaggtg 


43200 


gatcgctcgg 


cgctacccga 


gagcttcgag 


gaggcctccc 


gcgagcaggc 


tccggtggta 


43260 


cctccacgag 


gccccgtgga 


ggcgctgctg 


gtggacatct 


ggagggaggt 


gctcggcacc 


43320 


caacgggtat 


ccgtgcacga 


cgacttcttc 


gacctgggcg 


gacactccct 


gctggccacg 


43380 


cgggttgtct 


cccgcttgcg 


agaggcgctg 


caggtggagt 


tgccgctgcg 


caccctcttc 


43440 


gaggcccctc 


agctctcggc 


gctggcggcc 


caggtggagg 


tcctgctcgg 


ccaccggcag 


43500 


ctccgccccc 


ctccgctcgt 


gcccgctgta 


cgtccaccag 


agctgcccct 


ctccttcgcc 


43560 


cagcagcggc 


tgtggttcct 


ccagcagctc 


gcgccacaaa 


gcaccgccta 


ccagatcctg 


43620 


gatgcgtggc 


atgtgcgggg 


ccgggtggac 


gtgggcgccc 


tggagcgcgc 


cctggagcag 


43680 


ctcgtgcgcc 


ggcatgaggc 


gctgcgcacc 


actttcgagc 


ctggcggcga 


cggcgtgcct 


43740 


cggcagcgca 


tccacgcgcc 


ggctccggtc 


ccgctgcggc 


aggtggacct 


gcgctcccat 


43800 


ggcatcgccg 


cgcgggagga 


ggccttgcgg 


tggatgcgcg 


agcaggcgct 


ccggccactg 


43860 


gagctggaca 


aggggccgct 


gctgcgcgtg 


tccctgctgc 


gcttggaaga 


ggcacagtcc 


43920 


ctgctcttcc 


tcgagctgca 


ccatatcgtg 


ggcgatggct 


ggtcgctgag 


cgtttggagc 


43980 


cgagagctgt 


cgcacctgta 


cgaggcggcg 


ctccatggcg 


ctgagccggg 


gttggcgccg 


44040 


ctgcccgtgc 


agtacgcgga 


cttcgcgctg 


tggcagcgcg 


gctggttgca 


agggccggtg 


44100 


ctgcgggagg 


agctgacctg 


gtggcgggag 


cggctcgcgc 


gcctggcccc 


gctgcgcctg 


44160 


cccgcggacc 


atgcccgccc 


ggaagtgcag 


cgcttcaacg 


gcgcgacgta 


ccgcttcacg 


44220 


ctcccgggcg 


tacgggtgca 


ggcgctgcgg 


cggctcgggc 


atgagcacgg 


ggccacactc 


44280 


ttcatggtgt 


tgctcgccgg 


cttcaatgcg 


ctgctggcgc 


gctacaccgg 


gcagacggac 


44340 


atcgccattg 


gcgcgcccat 


cgccaaccgc 


acacgtgggg 


aggtggaggg 


cctcatcggc 


44400 


ttcttcgtca 


acaccctggt 


gctgcgcacc 


cgcctggaag 


gcaacccctc 


cttcctcgag 


44460 


ctgttgaggc 


gggtgaggga 


gacaacgcta 


gaggcctacg 


cccaccagga 


gctgcctttc 


44520 


gagaggctcg 


tcgaggagtt 


gcagcccgag 


cgccaggcca 


accagaaccc 


gctggtgcag 


44580 


gtgctgctcg 


ccctgcagaa 


cgcgccccgc 
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gagcacctgg 


agtacctggt 


ggccaccacg 


cgcttcgacc 


tcgagctgca 


cctgtgggag 


44700 


gaggaagagg cjgctgagctg 


catcgccgtg 


tatgaccgcg 


cgctgtacgg 


cgcggggacg 


44760 


gtggagcggc 


tggtgggcgc 


ctggtgcacg 


ctgctggagg 


gcgtggccga 


gctcccggcg 


44820 
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tccgccgccc 44880 


tcccccgccg 


eggegatega 


aacaagcatc 


ggcgcccgct 


teteggaggt 


ggegeggegg 


44940 


cagccaggcg 


ccacggccgt 


cacccaaggc 


gggcggcacc 


tcacctatgc 


agagctggag 


45000 


gagcgctcgg 


aacgcctggc 


ccgctacctg 


gcctggctgg 


gcgtgcgcgc 


gggegacaga 


45060 


gtggggtrgg 


ccaccgagcg 


caegctggag 


eggatcatea 


gectgetagg 


catcctcaag 


45120 


gcgggagccg 


cctacgtacc 


gctggatgtg 


cgccagccag 


cgcgtcggct 


gagectgetg 


45180 


gtgcaggccg 


cgggcgtgcg 


caccgtcatc 


gecgaggage 


aggctcgcac 


ggtgctgtca 


45240 


ggcctggggc 


agccactgac 


cctggtcgat 


gccgcccagg 


agccggcctc 


cgcgcagcag 


45300 


gtccccgcgc 


tegggcegga 


gcgctcgctg 


ggeggggaca 


tgctcgccta 


cgtgcttttc 


45360 


acctcgggct 


ccaccggcga 


gcccaagggc 


gtctgeatte 


cccaccgtgc 


cgtgctgcgg 


45420 


ctgatccatg agccctccta 


cgtccagctc 


tcgccgcggg 


aggtcatget 


gcactacgca 


45480 


ccgctggagt 


tcgatgcctc 


caccttcgag 


gtctgggggg 


cgctgctcaa 


eggggcaagg 


45540 


ctcgtgctgg tccctccgga 


acagcagtcg 


ctcgagagcc 


tgggacagga 


gctgagcacc 


45600 


cagggegtea 


ccgtgctgtg 


gctcaccgcc 


gggctcttcc 


ggctcatggt 


ggaggagcag 


45660 


ctcaagagcc 


tgcgcggcgt 


gcgccagctg 


ctcgccggag 


gggacgtgct 


gcccatgccg 


45720 


caggta cgee 


ggctgegega 


ggccctcccc 


gagtgecage 


teatcaaegg 


ctacggcccc 


45780 


accgagagct 


gcaccttcac 


ctgctgtcac 


cgcgtgggca 


gcccccagga 


getgggegge 


45840 


tccgtgccca 


tcggcacccc 


catcgacctg 


ggctgggtga 


gcgtggtgga 


cgagcggctc 


45900 


cagcccgtgc 


eggaeggage 


cccgggcgag 


ctgctcgtgg 


gcggccccgg 


gctggcttgg 


45960 


gggtacctgc 


aacacccgga 


gctcaccgcc 


gagegcttea 


tccccgatcc 


gctcagccgg 


46020 


aegceggggg 


ctcgcgtcta 


teggaeggga 


gacctggtgc 


ggcgcaggga 


ggatgggacg 


46080 


ctggagttcc 


tgggccgcgt 


ggaccaccag 


ctcaaggtac 


gcggcttccg 


catcgagccg 


46140 


ggcgaggtgg 


aggecgeggt 


gctcacccac 


ccggcggtgc 


agtccgcggt. 


ggtggtgggg 


46200 


cgcgagggcc 


ccggcggcaa 


ggagctggtc 


tgctacgccg 


tgccgcgggt 


ggagagctcg 


46260 


gagcagggct 


cgcagcagga 


geageggctg 


gtacacgagt 


gggagtccgt 


gttcgacggg 


46320 


cacatgtacc 


gcgaagcccc 


cgtgggcgga 


gagccgacct 


teaacategt 


cggctggaag 


46380 


agcagctaca 


ccggccaacc 


cgtggccgtg 


gaggagatga 


gggactggct 


gcgccaccgg 


46440 


gtggagcgcg 


tgegeggget 


caggccacga 


cggatcctcg aagtgggctg tggtacgggg 


46500 


ctgatgetet 


ttgccctgct 


gccgcactgc 


gagegctacg 


tgggcacgga 


tttctccccg 


46560 


ccggcgttgg 


actacgtgcg 


tcgctacctc 


cccccggagc 


acccgggccg 


cgtggagctg 


46620 


ctacaccgca 


eggeggaega 
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gtggcggccg 


gctccttcga 


cgcggtgctg 


46680 




^ggtgcagta 


cttcccctcg 


caggagtacc 


tgcggcaggt 


gctggcccgc 


46740 
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tgcgtggagg 


ccgtggagga 


cgggggcttc 


gtcttcgtgg 


gggatgtgcg gagcttgccg 


46800 


ctgctggagt 


ccttccacgc 


ctcggtggag 


ctggagcggg 


ccgccccctc 


catgccgctg 


46860 


gaggcgtggc 


gcgagcgggt 


gcggcgtgcg gtgctggagg acaacgagct ggtggtggac 


46920 


ccggcactct 


tcgtggcgct 


ggcccatcag 


catccacggg 


tgagccacgt 


ggatatcgag 


46980 


ctgacgcgcg 


gcacgcaccc 


gaatgagatg 


gcacgcttcc 


gctacaacgc 


cgtgctccac 


47040 


atcgggccgc 


gcacgccgcc 


accggcctcc 


gaggtgccgt 


gggtggactg 


gtccacgcaa 


47100 


ggactgagcc 


tcgatgcact 


gcgggctcgg 


ctccggcagg 


gcccgccagg 


gccactgggg 


47160 


gtggcaggca 


ttcccaatgc 


gcgagtgctg 


cccgcggtcc 


gggccgccga 


ggcgctgggc 


47220 


agcacaggca 


gcgcgcggcg 


ggtggaggag 


ctgcgccggc 


gcctgtccca 


acccggcaca 


47280 


ggggcccagg 


atccagaccc 


cttctggcaa 


ctggccgagt 


cgctcggcta 


cacggcggca 


47340 


gtgagctggt 


ctcctgggcg 


gagggacggg gccttcgacg 


tgctcttcct 


cccggcgacg 


47400 


ccagggatgc 


acccgcggtg 


gctcggcccc 


acgccgctca 


accgcacgcc 


gcccccagcc 


47460 


tccgcgcgtc tgtcctcgga gcctcggcgg gccagcctct 


cactgaggct 


gggctccgcg 


47520 


ctgcgggccc 


acctgcagac 


ccatctacca 


gacttcatgg tgccctcacg cttcgtggtg 


47580 


ctgcagtccc 


tgccgctcac 


tcccaatggc 


aaggtggacc 


gcgcggcctt 


gcccgtcccc 


47640 


gactcgcgtc 


ggctggaatc 


tgccccgctg 


gtgcccccca 


gcaacgagct 


ggagcgggtg 


47700 


ctggcgcagg 


tgtggaagga 


agtgctggga 


ctggaggaag 


tgagccggga 


ggacaacttc 


47760 


ttcgacgtgg 


gagggcactc 


gctgctgctg gcgcaggtgt 


gcagccggct 


ggaggcccga 


47820 


ctgggccggc 


ggctcgaact 


ggtgaccctc 


ttccgctact 


cctccattgc 


tgccctggcc 


47880 


gagcacctgc 


aggcccccca 


ggagctggca 


gccgccgagg 


cgcaggtgca 


gcgcatggcc 


47940 


acggagcgcg 


ccttgctcca 


gcaacaggca 


gcccaacgcc 


gccgggccgg 


ttcgaagcgg 


48000 


ggagctccca 


atgacaccta 


gtggggatga 


agcgctgcag 


tcttccatcg 


ccctggtagg 


48060 


catggccggc 


cgcttcccgg 


gggccccgga 


cgtggagtcc 


ttctggcgca 


atctggtggc 


48120 


cggagtggag 


tccatttcct 


tcttctccga 


ggaagagctg 


agacaggcgg 


gcgtgtccga 


48180 


gcagatacgt 


cggcgccccg 


agtacgtgcc 


cgccaagggt 


gtgctggaag 


acctggagct 


48240 


gttcgacgcg 


ggcttcttcg 


gctactcgcc 


gcgcgaggcg 


agccacctgg 


atccgcagca 


48300 


acgcctgctg 


ctggagtgct 


cgtgggaggc 


gctggaggat 


gcgggccttc 


gcccggatca 


48360 


gctcccaggc 


tgggtgggcg 


tctacgtggg 


cgccggggac 


accagctacc 


gcttccagct 
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gctgcgcgga 


cacggggacc 


cgctcagcgg ctccaaggat gtggcgggct tctttggcaa 


48480 


ctacccggac 


tttctcgcca 


cgcgcgtggc ctacaagctc aatctgcgcg 


gccccgcgct 


48540 


cggcatccac 


accgcgtgct 


ccacctcgct 


cgtctccatc 


aacatggcgt 


gcagcgcgct 


48600 
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gcgcggcttc gagtgtgaca tggcgctcgc gggcggcgtc tcgctgcggc tgcccgcgcg 48660 
ctccggctac ctctacgagg aggggggagt cgcctcgaag gacgggcact gccgcccctt 4 8720 
cgacgcgcgt gctaccggca ccgtcacggg ggatggcgtg ggcgtggtgg tgctcaagcg 48780 
cctggaggac gcgctgaagg ccagggatcc catccacgcg gtgatccgcg gctgggcgct 48840 
caaoaacgac ggcgcctccc gggctggctt caccgcgccc agcgtggagg gccaatccga 48900 
ggtgatcgcc ctggcgcacg ccgcggcggg catcagcgca cgcgacatca cctacgtgga 48960 
ggctcacggc accggcactc ccctgggaga cccgatcgaa gtggcggcgc tcacccgcgc 49020 
cttccgggcg cataccgctg acaccgcctt ctgtaccctc ggtgcggtca aatccaacat 49080 
cggacacctg gatgcggcgg caggagtggc cggggtgatc aagacggtgc aggccctgcg 4914 0 
ccaccggctc atccctccca cgctccactt cgagcggccc aatcccgcgc tccacctgga 49200 
gcagagcccc ttcttcgtca acacccagcc gctcccgtgg gagagcccgc gcgggcctcg 49260 
gctggcgggg gtgagctcct ttggcattgg cggcaccaac gcccacacgc tcttcgaaga 49320 
ggctccgccg ccgcccgcca gcggccccac ccgtcccaac caggtgctgc tgctctcggc 49380 
ccgaagcacc agcgcgctcg agcacatcgc aggacggctc gcggcacacc tgcgccgcca 49440 
cccggacctg gaactggcgg acgtggcctt taccctccag gtggggcgag cccgcttccc 49500 
ctaccggcgc gccctcacct gtcgcaccct ggccgaggcc atggagcgcc tggaggctcc 49560 
cgagccgcgg ccacccgagc cgcttgctca cgagggagag cggccccctc tcgtcatgct 49620 
ctttccaggg cagggcactc cgctggtcgg cacggcccgt gcgctccatg aatccgagcc 49680 
caccttccgc caggcggtgg agcagtgtgc ccggctcctg aggcagacgc tggggctgga 49740 
tgtcagggag gtgctcttcc cctccgcgga gcaggaggag caggcacgcc ggctggcggc 49800 
gcagacgcgg gtggcccagc ccgcgctctt caccctggag tatgccctgg cgcagacctg 49860 
gctgggctgg gggctccaac cgcaggcgct ggcaggccat agcctgggcg agctggtggc 49920 
ggcgtgcctc gcgggtgtct tctccctgga agatgcactt cagctcgtgg cagcccgggg 49980 
acagctcatg cagggatgcc caccgggggc catgctggcc gtgccgctgc cggaggccga 50040 
gctggcggcg ctgctgggca gcgagctgtg tatcgccgcg gtgaacgggc cacgggcctg 50100 
cgtggcctcg ggtcctctac cggcggtaga ggcgctcacc gccgcgctgg agagccgggg 50160 
cgtgtccagc cggcgcctgg agacttccca tgccttccac tctgcctcca tggaggcatg 50220 
ccaggggcca ctcaccacgc tgctgcggcg catgcgcctg caggcccctc gcctcccctg 50280 
tgtctccggc ctcaccggcc ggtggctcac cggcgaggag gccacggagc ctacgtactg 50340 
ggcccgccag ctccgcgagc cggtacgttt ctccgaggcc ctcgagacgc tctggagcct 504 00 
caaggagcca gtgctgctgg aggtaggccc cggcaccacg ctcaccgcgc ttgcgcggcg 50460 
gcaccccacc cgccccgccc gcacccagga ggtggcgagc cttoccgtgc aaccggacac 50520 
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tgagctgtgg 


caggccggcc 


tggagctgga 


50580 


ctggagcgcc 


ctgcacgccg 


cgccacgcca 


ccgcgcacac 


cttccaccct 


accccttcga 


50640 


gcgccagcgc 


tactggattg 


aaccggaggc 


tgccccgcag 


ccgcgcgcgc 


agcagccgac 


50700 


gcccgcgtcc 


cttgtgcccc 


cggagcagcc 


ctcgcgcgag 


gccctggaag 


actggttcta 


50760 


cgtgcccacc 


tgggagcagg 


cacctgccac 


gagcgggggg 


ggacagcccc 


tggcgggccc 


50820 


ggtgctcgcc 


ttcatggact 


cctcgggcct 


ggccgagcag 


gtgctggccg 


cgctgtggcc 


50880 


cgcggactcc 


ggcgcgctcc 


tcacccgcgt 


cgagccggcc 


gggcactatg 


agcaactcag 


50940 


tgagcacgcc 


ttccgcctcc 


gccccgagag 


cgaggaggac 


tgggacgcgc 


tcttccaggc 


51000 


gctccagtcc 


cagggccggc 


tcccccgtcg 


catcctccat 


gcctgggctc 


tcaccgcgga 


51060 


gcccggcccc 


tgtaccccgg 


acggggaagc 


ggtgctggaa 


cagggcttct 


tcagcctgct 


51120 


gcggctggcc 


cgtgcgctcg 


gccgccacgc 


gcctgagcgc 


cctgtacagc 


tcgaggtgct 


51180 


gtccagcttc 


gtccacgcgg 


tgggcccgcg 


ggagccgctc 


gagccgctca 


aggccacgct 


51240 


gctgggggcc 


tgtgcggtac 


ttcccctgga 


gtaccctcac 


gtgcagtgcc 


gcaccattga 


51300 


cgtgcggccc 


gggagcgagc 


cacgggaagt 


gctggtgagg 


agcctggccg 


ccgagctggc 


51360 


cgctcccatg 


ggagagagcc 


cggtggcctg 


gcgcgacggg 


cagcggtacg 


tgcgccgcgc 


51420 


cacccggcag 


aggctcgagg 


cctcgcggcc 


cctgcgcagc 


cttcgggagc 


gaggcgtgta 


51480 


cctggtagca 


ggagggctgg 


gtggaatcgg 


cctggtgctg 


gcgcgagcgc 


tggcccagcg 


51540 


ggcgcgtgct 


cggctggcgc 


tactcaccca 


ctcgcccttc 


cctccccgag 


agcagtggga 


51600 


gcagtggctg 


gaggaagccc 


cggcgcaccc 


ggagccggcg 


tggcggagcg 


aggccgaccc 


51660 


ctcggagcgc 


cgcaggaccc 


agcaccgcat 


ccgctgcctg 


ctggagctgg 


agcagctcgg 


51720 


cgccgaggtg 


caggtgtaca 


cggcggacgt 


ggcggaggag 


gccgccgtgc 


gctcggtggt 


51780 


ggagcaggtg 


cacgcccgct 


gggggaagat 


ccacggcgtg 


ctgcacgccg 


ccgccacgtt 


51840 


cgacgacggc 


gtcatccagc 


tccgcacgca 


cgagcagtcc 


tcgcgggccc 


tgcgcaccaa 


51900 


ggtgcggggc 


agcatggtgc 


tgcacgaggt 


gctggcgagc 


gaggggttgg 


attggttcgc 


51960 


cctgtgctcc 


tcgctggcgt 


cggcgctggg 


ctcattcggc 


caggcggact 


actgcgcggc 


52020 


caatgccttc 


caggatgcgt 


atgcgcacca 


cctgcgccgg 


cagggcttca 


cgggagcgct 


52080 


ggcgctggac 


tggggcacgt 


ggagagatac 


gggggcggcc 


atgcggctgg 


tggcacgcac 


52140 


ccgacggggg 


ggccatgaga 


agccgcccac 


gccgctcacc 


cacccactct 


tcgattgcga 


52200 


acagcgcgag 


ccgggcggga 


cgcactggct 


cggcctcacg 


ctgaggggtg 


gagaggactg 


52260 


ggtggtggac 


gagcaccggc 


tgcaaggggt 


gccaacactg 


cccggggtgg 


cctacctgga 


52320 


gctggcgcgg 


gcggcgtgtg 


cccaggcgct 


gggggccgag 


gcggtggagc 


tggcggagct 


52380 



WO 2004/022586 PCT/EP2003/009780 

29/38 

gttgctgctg gagcctctga cggtacctcg aggcgaatcc aggcaggtcc gggtggtgct 52440 

ccagcccgag gggcaggctc atgccctgcg ggtggagagc cggtcggagg aagcgcgggg 52500 

atggaatgag catgcgcggg gccgggtgcg cgccgtgcct cggctggccg agcgcatcca 52560 

gcccgagctg ctgcgcgccg cctgcgaaca cgagcagccc gtgcccggag agccccagga 52620 

gcaaggccct gtccacgccg gagcacgctg gcatggcctc ttccagtggg ttcgccgcgg 52680 

ccctcgccag gccctcgccc agctcgcgct accggagccc ttccacggtg acctcgagcg 52740 

cttcgagctc cacccggcgc tcatggacat ggccaccagc ttcgccattc ccggcggagt 52800 

gccctggctc gcctttggct atgagcgcgt gctcattcac ggtccgctgc caccgcaggt 52860 

gctcagccac gtgagcctgc ccgaggagtc gcaggctggc gcgcaacaac tccggctaca 52920 

ggtgcggctg ctggatctgg agggctggga gcgggtgcgg atcgatgggt acctgctgcg 5298 0 

gccgctgaag cccagcgacg ccagtgtcga gccggcggcg cccaacgtgg aagtcgccgt 53040 

ggggacgccg gggctgctgg agagccttgg cctgcgccgc tgcacccgcc ccgcccccgg 53100 

gccgcgccag gtggagatcg aggtggaggc cgctgggctc aacttcctgg acgtgctggg 53160 

cgcgctgggg atgatgccgg cattggaggc ggaggagagc gtactggggc gcgagtgctc 53220 

cggacgcatt gccgccgtgg gcgagggcgt cagcgggctg cgcgtggggg acgaggtgct 532 8 0 

ggcggtggcc ccaggctgct tccgctccta cgtgctggtg gatgagagcc aggtggtgcg 5334 0 

caggcccgcc tcgctggggc tcgccgaggg ggcggcccag atggtgccgt tcgccacggc 53400 

gtacttcgcc ctgcacaccg tgggccggtt gcggcgcggc gagcgcatcc tcatccacgc 53460 

cgcggccgga gggctgggtc tggccgccgt tcagctcgca tcccggaccg gggcggagat 5352 0 

attggccacc gcgggcagtg agcagaagcg cgagtacctg cgctcgctcg gcattgctca 53580 

cgtgctggac tcgcgcagca cctccttcgt cagcgaagtg cgcgagcgca ccggcgggcg 53640 

tggggtggat gtggtgctca actcgctggc gggagagctg ctcctggcgg gcctgtccgt 53700 

gctggccccg cacggccgct tcctggagct ggggaagagg gacctgtatg cggaccagca 53760 

ggtgggcctc cgaaccctgg cccgagggca gactttcgcc gccattgact tcggtcccca 53820 

ccacccggac ttccgagcgg tgctcgagga ggtggccacg caactcaccc agggccagct 53880 

cgagccattg cccacccgcc tcttccccgc gcggcaggtg gccgaagcct tcagcttcat 53940 

uy^eucgcg cngcacatcg gtcgcgtcgc cgtctccatg cagggggcga cggcattgcc 54000 

cgcgtccatg actcggggct ccaggcccgc accggtggcc gtacctcctt gggaggaccc 54060 

gcggctggcg ggcggcatct cctctgaaga gggtgcggag gccttcctgc gcgcgttgga 54120 

~~ **<?gc?crca ccgcagctca tcatctcccc ccaggacttc agctcgctgc tgcgcggcct 54180 

gc»y-ggca»c cagggcgtgc gcgaaaagga gcgcctcgtc accgggcgcg ccgccgccgc 54240 

-jgagccgn-g gccttgccac cctcctcgct ggagcagctc atcgagcagg tgtggcgcaa 54300 



WO 2004/022586 




PCT/EP2003/009780 



gcacctgggt 


gtggagcgcg 


tgcagcccac 


30/38 
ggacagcttc 


ttccagcttg 


gaggagactc 


54360 


gctgctgggc 


atccaagtgg 


cggcggatct 


ccgcaggcac 


ctgggtgtgg 


agctgcccac 


54420 


ggccaccctc 


ttcagccacc 


ccaccctcgc 


cgcgctggcc 


gcggctctgc 


gggctcgaca 


54480 


gggtgaggcc 


gcggctccca 


ctgcccccgc 


gccagcgctc 


gtgccggatc 


cagccgcgcg 


54540 


cttcgagcct 


ttccccctca 


cggatgtgca 


ggaagcctac 


tgggtgggtc 


gccgctcggc 


54600 


cttcgagctc 


ggcggcgtcg 


ccgcccatgg 


ctacttcgaa 


atcgaaagcc 


cggggctgga 


54660 


ggtggagcgc 


ttcatccaat 


gctggcgcca 


gctcctgcag 


cgccacgaca 


tgctgcgcat 


54720 


ggtggtgctc 


ccggatgggc 


ggcagcaggt 


gctcgagcag 


gtgccggagt 


acacgccgga 


54780 


ggtggtggag 


ctgcgggggc 


tctcccccca 


ggaggccgag 


tcccgccggc 


tccagctgcg 


54840 


tgagcgcatg 


gcccaccagg 


tcctgcgcag 


cgaccgctgg 


ccgctcttcg 


agctggtgct 


54900 


ctgccggtac 


gagggaggcg 


tccgcatcca 


catgagcatg 


gatgccctga 


tgctggatgc 


54960 


gtggagttca 


gcggtgcttc 


ggcaggactt 


cgcccagctg 


taccacgagc 


cgggccggcc 


55020 


gctggagccg 


ctggccatca 


ccttccgcga 


ctacgtgctg 


gcggagcgcc 


ggctgcgcga 


55080 


gggcgaggcc 


catgagcgcg 


cccgcgcata 


ctggtgggct 


cggctggaca 


cgctgccgcc 


55140 


accgcccgag 


ctgcccctgg 


tgaaggaacc 


ctcgcagctg 


gagcacgcgc 


ggttcaccca 


55200 


ccgcgaggct 


cggctcgagc 


cacaccgctg 


ggcccggctc 


caggagcggg 


cgcgcgccca 


55260 


cggcctcacc 


ccctcggccg 


cctgcatggc 


cgccttcgcc 


gaggtgcttg 


cccgctggag 


55320 


ccgtcacccg 


cgcttcaccc 


tcaacctcac 


cctcttccag 


cgcttgcccc 


tgcacccgca 


55380 


ggtggacgag 


ctggtgggcg 


acttcacctc 


cctggtcctg 


ctggaggtag 


aggcacacgc 


55440 


ggcgagcacc 


ttcgccgagc 


gtgcctcccg 


gctccaggca 


cagctatggc 


gggacctgga 


55500 


gcacggcagc 


gtgagcgccg 


tgcagctcat 


ccgcgagctc 


gtccgcaccg 


gccgccgctc 


55560 


cccgggcgcc 


atcatgcccg 


tcgtcttcac 


cagcatcctc 


agcctggatg 


cgcggcgcgg 


55620 


cccccagggc 


agcctctcct 


tcttcgaggg 


agaactggtg 


tacagcatca 


gccagactcc 


55680 


ccaggtgtgg 


ctggaccacg 


gagtccacga 


ggaggagggg 


gcgctcgtcc 


tggcgtggga 


55740 


ctcggtggag 


gcgctcttcc 


ctccgggcat 


ggtggacgac 


atgttccacg 


cctaccagcg 


55800 


gctgctgggg 


gcactcgccg 


aggaggagca 


agcgtgggag 


ggcgagctgc 


cggagctgct 


55860 


gcctcctgcc 


cagcgtgagt 


tgctcgcgcg 


ctacaacgcc 


acccaggcgc 


cgcggcccag 


55920 


cggacggctg 


gaggagggct 


tcttcaccca 


ggcgcggctc 


caccccgagc 


tccccgcgct 


55980 


gctcgcaccg 


gagcgcaccc 


tgagctacgg 


cgagctggca 


aggcgagccc 


aggcgctggc 


56040 


cgcgcgccta 


cgcgagctgg 


a ggtgcagcc 


tcaggagttg 


gtggccattg 


ccatgcacaa 


56100 


gggctgggag 


caggccaccg 


ccgtgctcgg 


cgtgctgcag 


gcggccgcgg 


cctacctgcc 


56160 
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guuggatcca 


gagcagccgc cgctccggct 


ccaccagctc 


ctggaggagg 


ggcccgcccg 


56220 


cgtggtgctc 


acccagtcct ccttgctgca 


caccgtgccc 


tggccgccgg 


gtgtgcaggt 


56280 


gatcgccgtg 


gacgaactcg agcctgcaac 


cgaggccccc 


ccgctaccgc 


ctcgcggtac 


56340 


accggagcac 


ctcgcctacg tcatctatac 


ctccggctcc 


accggcaaac 


ccaagggtgt 


56400 


ggccattgaa 


caccgggccg cgctcaacac 


cgtggtggac 


ctcaacaccc 


gctttggcgt 


56460 


cggcccggag 


gatcgggtgc tgggcctgtc 


cgcgctcacc 


ttcgacctgt 


cggtgtacga 


56520 


cgtgctgggg 


ctgctcggcg cgggcggagc 


gctggtgctg 


cccgcggcgg 


aggcggagaa 


56580 


ggatcccgcc 


cactggtggg agcggctggt 


ggctgggcgg 


gtcacggtgt 


ggaactccac 


56640 


cccggcgctg 


atgctgctgc tggtggagta 


cgccgagcag 


cgcgggctga 


agctgcccgc 


56700 


ggcgctgcgc 


ctggtcatgc tcagcggcga 


ctggatcccc 


gtggcgctac 


cggatcgcat 


56760 


ccgcgcactc 


ggcagggacg tgcaggtggt 


gagcctggga 


ggcgccaccg 


aggcttccat 


56820 


ctggtccatt 


gcctacccca ttggccaggt 


ggcaccgcag 


tggaagagca 


ttccctatgg 


56880 


catgccgctg gccaatcagc gattccacgt 


gctggacggg 


cggctggagg 


cacggccctg 


56940 


gtgggtgccg ggcgagctgt acattggcgg 


cgaggggctg 


gcacgcgagt 


actggaggga 


57000 


tgagccgctc 


accgcaacgc gcttcatccg 


tcacccgcgc 


acaggcgaac 


ggctgtaccg 


57060 


caccggggac 


caagggcgca tgctgcccga 


agggagcatc 


gagttcctcg 


gccgggagga 


57120 


tctgcaggtc 


aaggtgcagg gcttccgcgt 


ggagctgggg 


gagattgaag 


cggcgctggc 


57180 


ccagcacccg 


gcgctatcgg cgagcgtggt 


ggtggcacga 


ggagagcctc 


ggggagtgcg 


57240 


gcggctggtg gcctacgcgg ttcctcgctc 


gggtcagacg 


cccgcggcag 


gcgagctgcg 


57300 


ccgctacctg 


gcggagcggc tacccgcgta 


catggtgccc 


tccgccttcg 


tcctcctgga 


57360 


gtccctgccc 


cgcagccgca acgggaagat 


cgcccgagat 


cagctccccg 


agccgcagca 


57420 


gacacaggga 


ctggctgcac aggccgcggc 


ggccgacccg 


ctcgtqaaac 


ggctggcagc 


57480 


cctggtgaaa 


gaggcgctga ggctcgagcg 


cgtggagccc 


caggacagcc 


tgctggacct 


57540 


gggggcggac 


tcggtggcgc tcatccgcct 


catcaaccgc 


ctggaggcag 


agctgcagtt 


57600 


ccgcccgcga 


ctggccgaca tctacgagaa 


ccccaccgtg 


caggggctcg 


ccacgctcca 


57660 


ccaagagaag 


acaaagagcc agggagaggg 


aggcgctccg 


cggctcacgg 


cgccccgctc 


57720 


cacgcttctg 


cccgccgagg agtgggggcg 


cttcaaggcc 


aaccgcccgg 


gcctgcgccg 


57780 


cttccccgat 


ggaacgcctg aggtggcact 


gccaggcagc 


ggcttggcgc 


cggcccccga 


57840 


ggagctgaca 


gctctcgagc gccgccgcag tgtgcgcacc 


tactccctcg 


agcccgtgag 


57900 


ccacgagcag 


ttgggacggc tgctggcccc 


cttgcgcgag tgggaggtgc 


agggctcgcg 


57960 


gcgttacctc 


tacgcctccg cgggcgggct 


ctacccggtg 


cagctctacc 


tccacctcaa 


58020 


ycucyy-cgg 


gcgcgagggc tcgagcccgg 


cacctggtat 


tacgatccca 


gcacccatcg 


58080 
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gctggtgctc 


ctgtccgccg 


gagccgggct 


ggatcgccgc atccatgatc 


cgcaccagaa 


58140 


ccaggccatc 


ttcgactcgg 


ccgcgttctc 


cctcttcctc 


atcgcccgca 


tgggggccgt 


58200 


ggagccggta 


tacgccgaac 


atgccctgca 


cttcgccacg 


ctggaggcag 


ggctgatgac 


58260 


ccagcttctc 


gatctgggcg 


cggctcccag 


tgggctgggg 


ctgtgccata 


tcggcgacct 


58320 


ggacttcgcc 


caggcgcggg 


ggctcttcca 


cttggaggag 


gagcacgtgc 


tgctgcacag 


58380 


cctggtgggc 


ggagtactcc 


ccactcgggg 


gcaggaggcg 


gcctcggtgc 


ccgctgaggg 


58440 


aggaacggag 


gcgcggcaac 


tggcgcagct 


cctgcaacag 


gtgaagacgc 


tcacgcccga 


58500 


ggccgcgcga 


gcgctgctcg 


aggcccgccg 


cggcagcaag 


gggagaccac 


atgagtaagc 


58560 


tgcacgagga 


gctggagagc 


ctggcgccgg 


agcagcggga 


gctactcgcg 


gccctgatga 


58620 


aagagcaggg 


cctggacgag 


ggcgcattgc 


tgatgcccgt 


ggagcgcaag 


cccgaggggc 


58680 


tgccgctctc 


gtccgcccag 


cagcgcatgt 


ggttcctgca 


acagctcagg 


cctacgagcc 


58740 


ccttctacaa 


cgtgcacgcg 


gcgctccggc 


tcacgggaaa 


gctgaaggtg 


gagtgcctgg 


58800 


tgcacagcct 


gaatgagttc 


gtccggcgtc 


acgagccgct 


gcgcacggtg 


ttcccctccg 


58860 


ccggaggtca 


acccctgcag 


cgcatcctcg 


cgcccgcacc 


ggctgcgctc 


gagcagcgcg 


58920 


acctgagcgg 


cgtgcccgcc 


caggagcgag 


aggccgaggt 


gtaccgcgcg 


gtggagcacg 


58980 


cagcgctcgc 


ctcctttgat 


ctggagcgag 


agccgccgtg 


ccgcttcctc 


ctggtgcgag 


59040 


tggagccgga 


ggaacacgtg 


ctggtgttcg 


ccacgcacca 


catcgcagcg 


gatggctggt 


59100 


cgctcggcgt 


cttcgtgcag gagctgtgcg 


cgctgtacac 


ggctgctgta 


cacgcggagc 


59160 


caccggcgct 


gccaccgctg 


cggctgcagt 


acgcggactt 


cgccgcgtgg 


gagcggagcc 


59220 


ggctcaaggg 


tgggcgcgag 


cgcgagctgc 


tcgagtactg 


gcaggagcag 


ctggcagggc 


59280 


tgccggacct 


gagcaccctg 


ccgccagagc 


ggcccaggcc 


cccgctgtcc 


aagcaggagg 


59340 


gcgccacgtt 


cgaattcgcc 


ctgccaccac 


accaggtgca 


ggccctgcgc 


tcacttgctc 


59400 


aagcgcggcg 


cacctcgctc 


ttctccgtgc 


tgctcgccgc 


cttccaatgg 


ctgctggccc 


59460 


ggtgcgccgg 


ccaggacgac 


gtggccctgg 


gcatgcccat 


cgccaaccgc 


aaccgcaagg 


59520 


agctggaagg 


cctcatcggc 


tgcttcgcca 


gtaccctggt 


gctgcgcgcg 


aagccctccg 


59580 


cctcgaccgc 


cttcacctca 


tggctcgccc 


aggtgtccga 


gcagctccac 


ggcgccctcg 


59640 


agcaccagga 


ggtgcctttc 


gaacgattgg 


tggaggtgct 


ccagccccgg 


cgccgcatgg 


59700 




gctcttccag 


atcttcctcg 


ccatgcagca 


gcacccgctg 


cggcgcgccg 


/ DU 


agctgccagg 


cctgctgctc 


tcggagttcc 


cccttcgcag 


ccgcgtggcc 


cgcttcgatc 


59820 


tggagttcca 


cctctgggag 


tctgcccagg 


gcgtcgaggg 


caccctcatc 


tacgacgtgg 


59880 


acctgtatgg 


ccaggagggc 


gtagcgcaac 


tggtgcggcg 


ctacgtgagc 


ctgctcgaag 


59940 
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ccgtggcggc ggacccgacg cggacactgg gggagctggc gggtgagacg ctggccccgc 60000 

ccgtacggaa gcaggttctc gccctgtcgg agtgtccgcc cctcccacgc ccgagcacag 60060 

cccctcggac actggccgag gcacttctac agaccgccga gcgcttcccc acggccacgg 60120 

tcagcttcgt ccaggcacag ggctcttgca cggcctggac tctgccagag ctggtggagc 60180 

gggcccggcg cctgcaggcc ggcttgcggc agtggggatt gcggccgggc gacagtctgg 60240 

tgctggtgct ggggagggag gaagagacgg tagaggcact ctgggcctgc gtgctggcgg 60300 

gagtggcgcc cctggtgcta ccggcgccgc ccgcgagagc ggaggcgagt cccgcgctct 60360 

cacgtctgcg gcatgcccgg cagctgctgg gtgggccccg ggtcctcacc cggcaggaga 60420 

tgctgcccga tcttgccagg cagcttcagg tctcgcccac cgccgacatc ctgggggccg 60480 

tggaggagct gagagccacg ggtggcgagg cgcccctgcc cccagggcgc atggatgacg 6054 0 

tggccctgct caacctcacc tcgggcacca ccggcaaggc caagtgcgcc atgctcacgc 60600 

accgcaacct gctggtgcgc ctggaggcca ccaacgtcgt ctacgagtcc cagccgctcg 60660 

agcgcggcct cgtctggttg cagctgcaca acattggcgc cctgtccgag taccacctcc 60720 

gcccgctctg cgccggcatg cacaccttcc acgctcccac cgaggaggtg ctcgccgagc 60780 

ccctgcgatg gctggagtgg ctcgagcggt atggcattgc acagacgtgg gcgcccagct 60840 

tcgcctacag ccacctgctg gagcggctgc gcaaggtgga ggaccggcgc tggagcctgg 60900 

gaggcgtgcg cgtgctgctc agcgccggag agcagatctc cgcgcccatg gtggaggagc 60960 

tgatgcggag gttggctccc tcgggagtgc gggaggatgc cttcgtcgct gcctggggaa 61020 

tgaccgagac ggcttcgggg gtgacctatg cgcggcggcc cggaacgccg ccccgcatgc 61080 

acaccctgga gcgcgcgagc ttgagcggtc cgcttcggca cgcggcaccc gcttcgccca 61140 

ccgcgctgcg gctgatggat gtaggagccc ccattgccgg caccgccctg cgggtggtgg 61200 

acgcgagcgg cgagctgctc tccgaggagt gcgtggggcg cattcaagtg cgcggcgaga 612 60 

tgatctctcc cggctactac ggggacccga aggcctcggc ggcgctgctc accgccgacg 61320 

gctggctgga gacaggcgat ctgggcttcc tctccgaggg agcgctgacc attaccggcc 61380 

gcgccaagga tctggtcatc atccacggca ccaacttctc ctgctacgag atcgagtccg 6144 0 

cggtggagca ggtggaggga gtggcgccct cctcggcggc ggcggcggcg gtgcggatgc 61500 

tcgaggggag cagggaggag ctggcggtct tcttcgtgcc cacggaggga ctcgctccgc 61560 

aaccccctgc ctcactcctg tcgcgcatcc gccagcaggt gctcgagcag gtgggggtgc 61620 

ggatcgatca tctcatcccg ctcgagcctc accagctccc acgcacggag ggaggcaagc 6168 0 

tgcgccgctc ggagctgagg gcgcgcttcg aagcaggtga gctgagggcg ccgcaacccg 61740 
ctccggtccc gagcccctcg cgcccgctgg agcagctcat cgcctccgtc tgggcggagg 61800 
tgctcgagca ccaggacatc gcgcccgagg ccagcttctt cgacctgggc ggcaactcca 61860 
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tcctcctggt gcgcgtggag cgggcactgc gcgcgcggct cggcctggag ctgacgctca 61920 
tggacctgtt cgcctacccc acggtgcact cgctggcgga ctacctggag cctcgagccg 61980 
cccagctgcc agcacaggcc tcccctccca cgcaggccga gcgccggcgg gggatgcgcg 62040 
gcgcggcgct ggagcagaga cgcactcgcc gccaggcaca acgggacgag gactagacac 62100 
gcatgagcca cgacacgcag ctcgagggag ccgtagcggt agtggccatg gcggggcgct 62160 
tcccccaggc ccccaccctg gaggactact ggcgcatcat ccacgagggc gtggacgcct 62220 
ttacccagct cacggacgaa cagctcctgg cctccggagt ggggccctcg ctcctgcgcc 622 8 0 
aacccggtta cgtgcggcgc gccgccctgc tggacaggat ggaccagttc gacgccgcgt 62340 
tctttggctt ctccccgcga gaggcggagg tgctcgaccc acagcaccgg ctcttcctgg 624 00 
agtgcgcgca cgaggcgctg gagcgggccg gccatggctc cgagcgcgca agggggcgcg 62460 
tgggggtctt cgccagcgcc agcctcaaca gctactacct gcattccctg cacgggaacg 62520 
cccggctgag ggaggtgctg ggagacttcc agctcgccat cgccaacgac aaggacttcc 62580 
tgcccacccg cgtctcgtac aagctgggcc tgcgcgggcc cagcgtggcg gtgcagacgg 62640 
catgctccag ctcgctggtg gccgtccacc tcgcgtgcca gagcctgctc aatggtgagt 62700 
gcgccctggc gctcgcgggc ggctcctctc tctccgtgcc ccaggcccag ggctacctct 62760 
accaggaggg aggcattgcc tcaccggacg ggtactgccg ccccttcgac gcggcggcgg 62820 
cgggcaccaa ccgaggcaac ggcgtggggg tggtgctgct caagctgctc gaggaggcgc 62880 
tggcggacgg agacaccatc cacgccgtca tccgcggctc ggcggtgaac aacgacgggg 6294 0 
cacacaagat cggctacacc gcgccaagcg tggaggggca ggcctcggtc atctccgagg 63000 
cgctggaggt ggcggccgtc tccgcggaca gcatcggcta cgtggaggca cacggcaccg 63060 
ccacggccct gggagatccg atcgaggtcc aggccctcac gcgggccttc cgcaagcaca 63120 
cggagcgccg cggctactgc gctctcggct cggtcaaggc caacatcgga cacctggatg 63180 
cggcagcggg catcgccagc ctgatcaagg ccgtgctggt gctggagcgc cgccagctcc 6324 0 
cgccatgccc gcacttcacc tcccccaacc cacgcatcga cttcgagcgc agccccttct 63300 
acgtgagcgg gcgggggcag ccgtgggagc cggtggacgg cccgcggcgc gcgggcgtca 63360 
gctccttcgg tattggtggc accaacgtgc atgtggtgct ggaggaggcg ccgccccggc 63420 
cggcgccgcg gcgggcagcg cggccctacc agctcctgcc gctgtcagca cgcacggagc 63480 
gcgcccggga ggaggtcgag gcccggctgc gcgagcacct gcgccagcag ccccaggagc 63540 
cgctggagga ggtggcccac acgctccagg taggccgccg ccacatggcc tggcgcacag 63600 
cgctggtgag cagcagccct gcccaggcgg tagagctgct ggagaggcgc cgacccgagg 63660 
cgctgctgcg tggccagagt gcggcccagg cccgctccgt ggccttcctt ctaccgggcc 63720 
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agggctcaca 


gtacgtgggc 


atgggagcgg 


cgctccatga 


gtccgaggcg 


cccttccgtg 


63780 


aacaggtgga 


tctctgtgcc 


gggaaactcc 


agggcgtgct 


cgggttggac 


gtgcgccgcc 


63840 


tgctgtaccc tggcccgcgg 


gagcgggaat 


gggcccagga 


gcgcctgcgg 


gagacgcggg 


63900 


tatgccagcc ggtgctcttc 


accgtggagt 


acgcgctggc 


ccgactgctg 


gaggcctggg 


63960 


ggatacgccc 


caccgcgctg 


ttgggccaca 


gcctgggcga 


atatgtggcc gcgtgcctcg 


64020 


cgggcgtgtt 


ctccctggaa 


gaggcgctgg 


aggtggtggc 


cgcacgtgga 


cagctcatgg 


64080 


gcagcatgcc 


cccggggggc 


atgctcgccg 


tggggctccc 


agcccaggaa 


gtggaaccgc 


64140 


tcctacccgc' ggcactggcc 


ctggccgcgc 


acaacagccc 


ccaggcctgt 


gtggtggcgg 


64200 


gccccaccga 


gcccctggcc 


gagctgcgcg 


cacgactcga 


gcagcggggc 


atcgcgtgca 


64260 


cgcctctggc 


tgtctctcac 


gccttccact 


cgccgatgat 


ggagccggcg 


gtggccccat 


64320 


tcgtcgcccg 


gctgcgccgg 


atgcatctgc 


ggcctccgag 


cctgcccttc atctccaatg 


64380 


tgactggcac 


ctggatagag 


gccgaggagg 


cgacctcacc 


ggagtactgg 


ggccggcacc 


64440 


tgctgcagcc 


ggtgcgcttt 


gcccaggggc 


tggagcggtt 


gtgcgagggt 


gtgcagccgc 


64500 


ggctgctgtt 


ggaagtgggg 


ccgggccaca 


cactggggcg 


gctggcggcg 


cgccaaagct 


64560 


ccggccccgt 


gcaggtggtg 


tccacgctgg gctccagccg 


ggaggaaagc 


tcggaggtgg 


64620 


agcggctgct 


gacggcaatc 


gggcgactgt 


gggtggaggg 


agccgaggtg 


gactgggctg 


64680 


ggctccaccg 


cggcgaacgc 


cggaggcgcg 


cggtgctacc 


cacctaccct 


tttgagcacc 


64740 


agcgctactg 


ggtggagaca 


gccccccagc 


cttcccctcc 


cgagcggcct 


gggacacctg 


64800 


ccgagagccc 


tgtcaccagc 


agcttctacg 


tgccaggttg 


gagccgcgcg 


gctctgccct 


64860 


cggcggcccc 


ctcgagacgg 


gtgggccggc 


tgctgttgtt 


ggcggaggcc 


catgggtgga 


64920 


gccaggggct 


cgcggagcgg 


ctccgggccc 


ggggacactc 


cgtcaccctc 


gtcgagccag 


64980 


gtgagcggct 


cgagcgcctg 


accccggagc 


actggcgcct 


gcccccgggc 


cgccgcgagg 


65040 


acttccaacg 


cctgctggag gactccggag agctgccctc 


gagggtgctc 


cacctctggc 


65100 


tgctggggag 


cagccagagc 


ccccaggagc 


gcggcttcta 


caccctgctg 


gcgctggccc 


65160 


aggcactggg 


agcccacggc 


acccggcccg 


cggtggagct 


cacggtggtg 


accgatcagc 


65220 


v ~~acgcggu 


ggcagagggc 


gagccgatac agccgctcaa 


ggcgctcctg 


cagggccctg 


65280 


cttccgtgct 


tccccaggaa 


ctccccgggt 


gcacgtgccg 


cctcgtggac 


atggccctgc 


65340 


cgccaggcgg 


cgtggcggag 


gagctgctgg 


agcggctggt 


ggcggaggtg 


gagtccaccg 


65400 


cqtccqaacg 


ctcggtggcg 


taccgcggcg 


cggctcgctg 


ggtgcgcgag 


ttcgtccccg 


65460 






ccttcgcagc 


tcccgcttcg 


gcagagaggg 


gtgtatttga 


65520 


■- — - — .-f .-r 


cctggggggc 


gtagggctcg 


cgctggcaga 


gcacctggcg 


cggcgcgtgt 


65580 


cggcgcggct 


ggtgctcacg ggccgctcgc ccacgccacc gcgcgagagc 


tggagcgcat 


65640 
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ggctgggcac 


tcccacgcgg 


ctgcggctct 


cacaggagct 


caaqtqqctq 


cqcqqfcgcco 

3 ^ 3 3 3 3 


65700 


ccgagcagat 


cgagcaccag 


cggcccctgc actccctggc 


cgacagtccc 


oacctqqaaa 


65760 


agtcgctgcg 


cctgctgtgc 


gccagctacc 


tctaccactt 


tctcttccct 


ctt caggege 


65820 


ctctgagact 


tggagagccg 


agggcgatgc 


aggcattgcg 


tqaqcqcctq 


qqactqcaqc 

3 3 3^ 3 w ^ 3 ^ 


65880 


ccggcttcga 


gcgcctcttc 


tccttcatga 


tcggcaccct 




aagctgattc 


65940 


gagtgggaga 


gggcatgctc 


gagggtcagg 


tggagccagc 


ccacgtgccc 


acacctcgag 


66000 


cactgcacga 


gcgactgctc 


gaaggctacc 


cggaggtgtc 


aqqcctqctq 


qaoctactto 

3^3 3^ ^ *- 3 


66060 


agcattgtgt 


gcgccactac 


cctgaggtgc 


tccgagggag 


cctgccggcc 


ctcagtgtgc 


66120 


tctacccctc 


gggccgctcc 


gaggaggact 


ccgcacgaag 


cqqqq tcqaq 


tggagcagca 


66180 


tggggcagtg 


cgtggcgctg 


ctgcggcatt 


tcctcgcccg 


ccafccrcccracr 


en r~>?\ rrr r*a rtrr 
u q. y v_ ciy y 


6 6? 4 0 


ggcgtaccct 


gcgcatcctc 


gaggtgggcg 


gcggcagtgg 


cataatactc 


cacfcrccctcfc 


66300 


tgccgctgct 


gcggcaccac 


ccggtggagt 


accacttcac 


ggatattggc 


ccctccttpft 


66*360 


tccgcgccat 


ggagggagtc 


ggccgccgcc 


agggcctcac 


cttcctccac 


acct cggtgc 


66420 


tggacatctc 


ccgccctccc 


ccagagcagg 


gccatccgcc 


tggctegtat 


aa tctcrcrt era 


66480 


ttgccctcaa 


cgtggtgcac 


gccaccccgc 


gcgttcctca 


gtccctggcc 


cacctccracfa 

*np** V*4 S-* » U u ci 


66540 


gcctgctcgt 


cccaggcggc 


cacctgtgcc 


tggtggagac 


qqtqaaacaa 


ca crccc t cfcrcr 

y \^ ' w y \*4 y 


66600 

w U U V V 


tggacatgat 


ctggggtttg 


gccgaggggt 


ggtggagcta 


caaoaacaaa 
w y a yy wu y a y 


cfccrccf ca err 


66660 

U U U u u 


gctctccctt 


gctggaggta 


ggcgactggg 


agcacgccct 


qcqcaacata 
y , »* 3 j 3 y 


cr cr c 1 t~ t" nr r* rr 


667?o 


aggtggaggt 


gctgccagcg 


gccgtggagc 


agcgctcacg 


ctgggacaac 


crfccrcfcccfcc^ 


66780 


ttgcccagcg 


gcccggcgag 


agcgggcttc 


agccagtggg 


cqgqcqcqca 


gecatgeagg 


66840 


agcggatccg 


ccggctgcgg 


gccattgaag 


aagcgggagg 


cqaqqtqcta 

3 w y w k*y 


cccctcrcrtcrcr 

v-> \_< v-. v» y y ^"33 


66900 


cggacgtcac 


cgatcgggag 


cgcatgggcg aggtgctcgc 


cqaqataaaa 




66960 

\J \J Zs \J \J 


gggcgctcca 


tggggtgatc 


cacgccgcgc tggtgctgga 


ggatqqgctq 

3 3 ^" 3 3 3 3 


atgeaget ca 


67020 


agacgcgcga 


gtcggccggg 


cgcgtgctgg cctccaaggt 


cqaqqqcacc 


ctcrcrtcfctacr 


67080 


acgaactgct 


gcgcgacgag 


ccactcgact 


tcttcgtcct 


gtgctcctcg 


ctgagegcat 


67140 


tgctcggcgc 


gctggggcag 


gcggactatg 


ccgccgccag 


tgccttcctc 


gaegegta tg 


67200 


cgcactcaca 


gcgagggcgc 


acggaccgcc 


gcaccatctc 


cataqactaa 


apt +■ /-^ i-r /-> +- rrrrr" 
y a l. y L-^yy l 


67960 


t or or a cr or t cr cr cr 


cq c co cca trr 


cgcctggggc 


tggggctggc 


ggegggagea 


ctggggctgc 


67320 


agcgcacggc 


acccggagag 


tacacggtgc 


gatggcaggc 


cgagcgctgc 


tggtggctgg 


67380 


acgagcaccg 


cctggaagga 


cgggccacgc tgccgggggt 


ggcgtacctg 


gagctggtgc 


67440 


gagccgccct 


ggtccaggaa 


ctgggcgagg 


cccccgtgga 


gctggagcaa 


ctggtgctgc 


67500 
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ggaagcgccc gcgggggagg 
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aggtggaggt 


gcgattccac 


ctgcgaccgg 


67560 


aggatgaggg 


ctatgcgctg 


gagatccgca 


gccgggctgg 


cggcctcgcg 


aatgggggct 


67620 


ggcgcagcca 


tgcgatgggc 


cgggtgagga 


tgctgcctcg 


cgggagcgct 


cgacccaccc 


67680 


acccgctacg 


ggagctggag 


gaacggctcg 


gcttgaccca 


ggcgccgagg 


gagcatgagc 


67740 


cagctcaggg 


cccccgcgag 


ctggcacctg 


ccctgggccg 


gcgctggtcc 


tcgctgagct 


67800 


ggcaccgcgc 


gtggaaggga 


gaagagggcc 


tggccttgat 


cgagctcccc 


gaggagctgg 


67860 


ccgaggatct 


gcggcaatgg 


cccctgcacc 


ctgccctgct 


ggacgcggcg 


accgggtttg 


67920 


caccgctgcc 


ccccggtgcc 


tggctgccgc 


tgtcctatgg 


caacacccgc 


atccacggcc 


67980 


cgctgccgcg 


gcagctgtac 


agccacatcc 


gccggctgga 


gcccgccagt 


gcccaggccg 


68040 


gagtggtgcg 


gctggaggtg 


cgcctcatgg 


acggcgaagg 


gcgggagctg 


ctgtgcgtgg 


68100 


aggagttcgt 


gctccgccgg 


gtggaggtgg 


actcgttggc 


tcggccacag 


cccgctggca 


68160 


aggctccgcg 


ccaggcagtg 


cttccacgac 


caggcgcgct 


cgactcgctg 


cgactgcaac 


68220 


ccctggagcg 


ccttccgccg 


caggagggag 


aggtggaggt 


gcaggtgctc 


gccgcgggcc 


68280 


tcaatttcaa 


ggacgcgctg 


ctggcgctgg 


gggcgttgcc 


agtggagctg 


gccaacggcg 


68340 


cccca ctggc 


gctgggagtc 


gagtgcgcgg 


gcatcatctc 


cgcggtaggt 


ccaggagtga 


68400 


gaggcctgcg 


agtgggcgag 


gcggtggtgg 


ccgcggcagc 


aggagccttc 


gcctcgcatg 


68460 


tgcgagtccc 


ccaggagcag 


gtcttcccca 


agccagcggg 


cctgagcttc 


gagcaggcgg 


68520 


cgatggtgcc 


cgtcactctg 


ttcaccgcct 


ggtatgccct 


ggaggagctg 


gcgcggctgc 


68580 


gcgcgggaga 


gcacgtcctc 


atccacgccg 


cggccacggg 


ggtgggactg 


gcgggggtga 


68640 


agttggcgct 


gcggcggggc 


gccacggtgt 


acgccaccgc 


gggcagcgag 


ccgaagcgcg 


68700 


agctgctgcg 


ctccctgggg 


gtaacgctgg 


ccatggactc 


gcgcgcgccc 


gggtttgacg 


68760 


accagcttct 


gcaacacacc 


caggggcgag 


gcgtggacgt 


ggtgctcaac 


tcgctgagcg 


68820 


gcgagttcct 


ctcccgcagc 


ctgggcgtgc 


tcgctcgcca 


cgggcgcttc 


gtcgagctgg 


68880 


gtgtaaggga 


catgctctct 


ggcggcacgc 


tgccgctggc 


tcccttcgag 


cgcgggctca 


68940 


ccttcctcgc 


ggcgcagata 


gacccgggga 


tgaagggcta 


ccgggagctg 


atgggcgagg 


69000 


ccctgcggca 


gatcgagcgc 


ggagagctgg 


agccgctgcc 


ctacacggca 


tggccactgg 


69060 


agcgcgtggc 


cgaggcgctc 


cagctcgtct 


ccaagggcag 


gcacgtaggc 


aaggtggtgc 


69120 


tcaccccgga 


agagcccttg 


tcacggccgc 


cagcgctccc 


gagctcctca 


aggccagtgc 


69180 


aaccccgggc 


aggcgtggcg 


attgtcgggc 


tgcgctccgc 


cgagggctgc 


gaggccttcg 


69240 


a gcgca tcct; 


cgcggccggc 


ctgccgcagg 


tggccgtctc 


cacccgtgag 


ctgcgcgcgc 


69300 


gcatggcgga 


gatcgagcgc 


ctgcgcgtct 


cctcctggga 


gccggcgctt 


ccctcggtgc 


69360 


cccgcggctc 


ccccgccagg 


accgaacgct 


ccaagcccta 


tgttgctcct 


cgcaccgagc 


69420 
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gcgagcgcgc 


cctggccgcg 


ctctggcagg 


acctgctcgg 


cgtctccgag 


gtggggttgg 


69480 


atgatgactt 


cttcgagctg 


aaaggggact 


cgctgctcgc 


catccagctc 


atggggcggt 


69540 


tgcgcaagga 


gctggaggtg 


gaattgccac 


tctcctcctt 


cctcgcccgc 


cctacgctgc 


69600 


gcacactgct 


ctccgccatg 


ccagccccgg 


caacggacga 


gcctgcctcg 


gccaccgggg 


69660 


caatcccagc 


cgccgctccc 


cagccgtcac 


cggctgtctc 


cagcgaaccc 


gctcggaagt 


69720 


ggcgccacct 


ggtgcccatc 


caacctcagg gcgagagccc tcccttcttc 


tgggctgcgc 


69780 


cgctgatggg 


caccgtgttt 


ccctacttca 


cgctggcccg 


ctgcctggcg 


cccacccacc 


69840 


ccttctatgc 


tctccagccg 


ccagggttgc 


aggaggggca 


ggagcccatg 


gacaaggtgg 


69900 


aggaactggc 


ggccctgtac 


gtgcgcgagc tgagagagct 


gcagccgcga 


gggccctacc 


69960 


gcctgggagg 


ctggtccttc 


ggctgcgtgg 


tggcttatga 


ggttgcccta 


cagctcgagg 


70020 


cggctggaga 


gcaggtggcg 


ctgctgagcc 


tgctggactt 


ccctgcccct 


tcgggccagc 


70080 


gagccagtct 


ctgggcgacg 


gcccgcttct 


tcagcggctc 


ggtgctgcgc 


ggcctggcgc 


70140 


cctatacgct 


cgactatctc 


tacctggcgg 


ctcgcgctcc 


tctgaatcca 


gacgctgcct 


70200 


cgcgcctgag 


cagctccctg 


acccgaggat 


gggagcgcct 


gcggaaagga 


gggctggtgc 


70260 


aggaactgct 


ggaccatgcc 


gccatggccc 


gggtggtgcc 


tgaggactca 


cggctgctgc 


70320 


tgctgcgaga 


gcctggcatt 


gcccccatgt 


tgaggctgac 


ccgggctcac 


cagcgagcca 


70380 


tgctctccta 


tcggccccga 


ggtcgcctcc 


gccagcgcat 


cgtcctcttc 


cgcacctccg 


70440 


agcaggcctg 


gagcttcgcg 


cgagacctgg 


gctggggagc 


cctcagcgcc 


tcgggcgtgg 


70500 


atgtgcgcga 


ggcgccggga 


gaccatatga 


cgctgctgcg 


gccacctcac 


gtggagcggg 


70560 


tggcagagca 


gctccgggcc 


ctgctcagct 


ccgcgcctct 


cccgactcgc 




i U D/i u 


gggctgcgcg 


ccagaggctg 


cctcatgcat 


gccctttggc 


agccagcccg 


agctgtccac 


70680 


ccactccagc 


gcctcgctca 


gatcgtgctt 


ttcgacgatc 


tgatccaagg 


tgaccacctg 


70740 


cgcttccacg 


ctgttgccca 


gatgagaggg 


cacccgcgga 


tc 




70782 



WO 2004/022586 j^M 

SEQUENCE LISTING 



PCT/EP2003/009780 



<110> Gesellschaft fuer Biotechnologische Forschung mbH 

<120> Tubulysin-Biosynthesegene 

<130> 13289 

<150> DE 102 41 152.7 
<151> 2002-09-05 

<160> 46 

<17 0> Patentln version 3.1 

<210> 1 

<211> 82868 

<212> DNA 

<213> Angiococcus discif ormis 

<220> 

<221> CDS 

<222> (5719) . . (7164) 

<223> ORF14 

<220> 

<221> CDS 

<222> (12841) . . (13881) 

<223> ORF11 



<220> 



WO 2004/022586 

<221> CDS 




PCT7EP2003/009780 



<222> (14942) . . (15586) 
<223> ORF9 

<220> 

<221> CDS 

<222> (15847) . . (16983) 

<223> ORF8 

<220> 

<221> CDS 

<222> (22366) . . (23532) 

<223> ORF6 

<220> 

<221> CDS 

<222> (24591) . . (26513) 

<223> ORF5 

<220> 

<221> CDS 

<222> (26597) . . (27517) 

<223> ORF4 

<220> 

<221> CDS 

<222> (29858) . . (30400) 

<223> ORF3 



<220> 

<221> CDS 



WO 2004/022586 




<222> (31220) . . (32392) 

<223> tubA 

<220> 

<221> CDS 

<222> (35522) . . (40147) 

<223> tubB 

<220> 

<221> CDS 

<222> (48011) . . (58558) 

<223> tubD 

<220> 

<221> CDS 

<222> (62103) . . (70616) 

<223> tubF 

<220> 

<221> CDS 

<222> (75488) . . (76645) 

<223> ORF20 

<220> 

<221> CDS 

<222> (76787) . . (77545) 

<223> tubG 

<220> 

?22Z> CDS 

^22> (77769) . . (78695) 

<223> ORF21 



PCT/EP2003/009780 



WO 2004/022586 




PCT/EP2003/009780 



<220> 

<221> CDS 

<222> (79138) . . (80019) 

<223> ORF22 



<400> 1 
gcggccgcct 


tcttcaccgc 


gggggcctcg 


gcctcctcgg 


tggtcggggc 


gggcgccagc 


60 


ttgtcgccgt 


gcaccttgcc 


gggagcggtc 


gccttcttct 


tcgaggcgga 


gcgcacgggc 


120 


tcgccctgcg 


tcacctcgcg 


cgtgggcagc 


atgtccccgt 


gggccttcac 


cgggccggac 


180 


tcaccctcct 


ccgcctcggc 


gccctcggcc 


ggcgtctcgc 


gggtgtcgcc 


accctccttg 


240 


gcccggttga 


actcgcgcat 


cgcgtcgtcc 


accagcccca 


tggtcatgta 


cgccacgccg 


300 


aggtcatggt 


ggtcggatgg 


cgacaggccc 


tccttcgtgc 


cctcgcgcag 


cttcgcgagc 


360 


gcgtcctgca 


cctgcgtgtc 


cccggcctcg 


gacgcggtgc 


cggtctcgcc 
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ttcagctcct 


ggcccaggtc 


cacgccgccc 


ttctcttcct 
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tgggccgggg 


cggaggggga 


ggcaccctct 


tccgcggtgg 


gggtgctgct 


ctcgggagtg 
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gtgctctcgg 


acggcggcgt 


ctctggcatg 


gcgggctccg 
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ctgcaggtgg 
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tcctgaagct 


tggccttgcg 


ctccttcagc 


tcctcgacgc 


gggcgcggtc 


cttctccacc 
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gcgccttggc 


cacgaagttg 


gggttctcca 


gcttgcgcag 
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atctcctgct 


cggcgcgggc 


aatctccttc 


ttcaaccgct 


cgcgctccgc 


gtccaggtcg 


780 


acgaggcccg 


ccagcgggac 


gtagatctcc 


aggttggagc 


ccacgaaggc 


ggcggcctgc 


840 


ggcggcttgg 


cgcccggcgc 


tcccaccgtc 


acctcggaca 


ggccggccag 


cggcatgagg 


900 


tacrcccrccrcc 


acrccrcfcccacr 


c a cr c t a or c cr c 


crtcrccrcaccft 


ccgcgctctg 


caccaccgcc 


960 


ttcaccttgg 


tggcgggcga 


caggttgctc 


tcgccgcgga 


tggtgcgcag 


gccctcgatg 


1020 


gcggcgatga 


ccggcgccat 


ctcgccttcc 


gccgcctcgt 


ccacgtgcgc 


cgcgtccggc 


1080 


tccgggtacg 


gggcaatcat 


gatgctgtcc 


gtcggccggg 


ccatcggcag 


cttctgccag 


1140 


atctcctcgg 


tgatgaacgg 


catgaacggg 


tgcagcagcc 


gcaggatgcg gtccaggcag 


1200 


tacacgagca 


cccggcgcgt 


ggagtccttc 


gccgccgcgt 


cctcgccgta 


cagcgagccc 


1260 


ttcgccagct 


cgatgtacca 


gtcgcagaac 


tccgcccaga 


ggaactggta 


gagcgtggag 


1320 


gcggcctcgc 


cgaagctgta 


cgactccagc 


agcgcgcgcg 


cctccgtggt 


ggcgcgctgg 


1380 


aggcgggaga 


gaatccagcg 


gtccgccagc 


gtcagtgtgc 


tccggtccag 


cgggccggcc 


1440 


tccatccgga 


agtcgcccat 


gttcatcagg 


gcgaagcggc 


tggcgttcca 


cagcttgttg 


1500 
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ccgaaggcct 


tgtagccctc 


gagccggtcc 


atcgacagct 


tgatgtcgcg 


gccctgctgg 


1560 


gtgagcgagg 


cgagcgtgaa 


gcgcagcgcg 


tccgcgccga 


aeggeggcat 


gccctgcggg 


1620 


aaccggttct 


teagegtegg 


ctggagcttg 


teggcegggg 


cgccgaggac 


gatgtccagg 


1680 


gggtcgatga 


cgttcccctt 


cgtcttggac 


atcttctggc 


ccttctcgtc 


gcgcaccatc 


1740 


gcgtgcaggt 


acacggtgcg 


gaagggcaca 


tcccccatga 


agtggatgcc 


catcatcatc 


1800 


atccgggcga 


cccagaagaa 


gatgatgtcg 


tggcccgtct 


ecatgaegga 


egtegggtag 


1860 


aaggtcttca 


gctccggcgt 


ctcacgcggc 


cagcccagcg 


tggagaaegg 


ccacaggccg 


1920 


gacgagaacc 


aggtgtccag 


cacgtccggg 


tcctggatga 


aggacgcgcc 


gccgcacttc 


1980 


gggcaggact 


tcggctgctc 


gcgcgcgacg 


atgggctccg 


cgcgcgcgaa 


gtccaccccg 


2040 


cccaccttca 


ccgtgggggc 


gtccagcggc 


aggtccgtgt 


cgtcgccctg 


ccgcgggctg 


2100 


cacgaggtgc 


agtagtaege 


gggaatctgg 


tggccccacc 


acagctggcg 


gctgacgcac 


2160 


cagtcgtgga 


tgttgcgcat 


ccagtggaag 


aacgtgttcg 


tccaggactc 


ggggacgaac 


2220 


ttcgtgcggc 


cctgctccac 


cgcctcgatg 


gccggcttcg 


ccagcggttc 


aatcttgatg 


2280 


aaccactgcg 


gagacaggcg 


tggctccacc 


accgtggcgc 


agegctggea 


ggtgcccacg 


2340 


ttcagcatgt 


ggggctcttc 


cttctccagc 


agcccctgct 


ccgtgaggtc 


cgccagcacc 


2400 


tgcttgcgcg 


cctcgaagcg 


atccatgccg 


gcgtacttgc 


cggtctcctt 


cgtcatccgg 


2460 


gccgcttcgt 


ccaggatggt 


gagcatgggc 


agcttgtgcc 


gcaggcccgt 


ctggtagtcg 


2520 


ttgaagtcgt 


gcgccggcgt 


caccttcacc 


acgccggtgc 


egaacttegg 


gtccaccagc 


2580 


tccgcgtccg 


cgatgatggg 


aatctcgegg 


teggtcageg 


gcagcttcac 


gctcttgccg 


2640 


gccagcccca 


ggtagegetc 


gtcctccggg 


tggatggcca 


ccgcggtgtc 


gcccagcatc 


2700 


gtctccgggc 


gtgtggtggc 


caeggtgage 


gtgeggtege 


tgtccttgac 


ggggtagegg 


2760 


atgtgccaga 


tggagecett 


cttcgactcc 


tcgtgctcca 


cctccaagtc 


getgagggeg 


2820 


gtgcggcacg 


aggggcacca 


gttgatgagc 


ttctgggccc 


ggtacatcag 


gccctcttcg 


2880 


tacaggcgca 


cgaagacctc 


gcgcacggcg 


geggaggact 


gctcgtccat 


ggtgaagcgc 


2940 


tcgcggctcc 


agtccagega 


ggcgcccagg 


aagcggtgct 


gctcgccgat 


gcgggcgccg 


3000 


tacttgccct 


tccactccca 


gacgcgctcc 


aggaaggegg 


cgcggccgag 


gtcgtgccgg 


3060 


ctcttgccct 


eggtcttett 


cagctccttc 


tccaccacca 


tctgcgtggc 


gatgeeggeg 


3120 


tggtccgtgc 


eggggageca 


cagggtgttg 


aagccgctca 


tccgcttcca 


gegegegagg 


3180 


atgtcct,gga 


tggtggcggt 


gagcgcgtgg 


ccgatgtgga 


ggctgcccgt 


cacgttgggc 


3240 


ggcggcagca 


cgatggagaa 


ggcgggcttg 


teggaggteg 


cctcggcgcg 


gaagtagttc 


3300 


cere cc cat cc 


ageaggegta 


ccagcgggcc 


tcgacctcgg 


tgggctcgta 


ggccttggac 


3360 
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agttcagtgg 


tgtcggtcat 


tgcggacggc 


cagccccggg 


ggaaccgggg 


ccggcgggaa 


3420 


ggtcttggag 


agaggttaag 


gcgaggaacg 


cgtcagtgct 


gcgtctcccg 


gtccttgatg 


3480 


agccgctcga 


gctcctcacg 


gatgatggtc 


tccgccagct 


gcggtaccac 


ctcccaggca 


3540 


atcttctcga 


tgacctcgcg 


ggaggccttc 


gagagcgcct 


cgcgcagcag 


cgcctcgcca 


3600 


ccatccgcgg 


cggtcggacg 


ggccgtggcc 


ggagcggcgg 


tcggcgccac 


cgcggtcgga 


3660 


gcggccgtcg 


gggggccgcc 


catgtccagg 


gagatctcct 


cggcaccgct 


gggctcgggc 


3720 


agcgagtcct 


cgatgctgat 


ggaaggctgg 


gctgcctgcg 


gcgccgccgt 


gggagcgccc 


3780 


aggccgaacg 


ggtccctcgc 


gcgggccgcc 


ggctgggtcg 


cgccacccgg 


cgcgggaggc 


3840 


agcggcgcgg 


cgccgcccgg 


aggacgcggg 


aagcccggct 


ggctgccctg 


gggaatcccc 


3900 


ggacgggcca 


tgccggcgcc 


ggcccacccg 


gaggcgcggg 


cacgcccgga 


cgcggcggca 


3960 


tgcccggcgc 


ggacgcaccc 


ggcggaggcg 


gcacgcccgg 


gacgcgcggc 


ggcgcccgga 


4020 


ggcggcatgc 


ccgggcccgg 


aggacgcggc 


gcgcccggag 


cacccggcgg 


cggcattcca 


4080 


ggccccggcg 


ggcgggccat 


gcccggtccc 


ggcggcattc 


caggccccgg 


aggacgcggc 


4140 


gcgcccggag 


cacccggagg 


cggcattcca 


ggcccgggag 


gacgggccat 


gcccggcccc 


4200 


ggcggcattc 


caggccccgg 


aggacgcggc 


gcgcccggag 


cacccggcgg 


catgcccggg 


4260 


cccggaggac 


gcggcacgcc 


cggctgggtg 


ccctgcggaa 


tgcccggacg 


ggcggccgtg 


4320 


ggcgcgcccg 


gctgcgggcc 


cggaggcacc 


ggcgcgggag 


tggccggctg 


ggcggtgggc 


4380 


gggcgcacct 


gcgtggccgc 


ggaggccggc 


atggtgttgg 


acttctgccc 


caccagcgcc 


4440 


ttcaccttgt 


cgagcagcac 


ctggctctcg 


aagggcttgg 


cgatgtggtc 


atcggcgcgg 


4500 


gcggcgcggg 


cgcggttctc 


gtcgaaggcc 


tcgaaggtcc 


ccgccagcag 


caccacgggg 


4560 


atgccctggg 


tggccgggtc 


gctcttcagg 


gcctcgcaga 


cctcgtagcc 


gctcttgccc 


4620 


ggcatcatca 


catcggcgag 


gacgacgtcc 


gggcgcagct 


cgcgggtacg 


ggagatggcg 


4680 


tccagcccgt 


tgtccaccgc 


ggtcacctga 


aagtcctcgg 


tcgcgaagat 


catcccaatc 


4740 


accttgcgga 


tggtgagcga 


gtcatcggcg 


accagcagat 


tcttgggcat 


cggttcgggc 


4800 


ctcggggggc 


gtcccacccc 


cctgaattcg 


ttggagattc 


ctgccgcgaa 


aaacccaggt 


4860 


cgcggccctg 


tatcaagcgg 


ggcagcttac 


gtttcgcgct 


ccagcacgat 


caagaaaaca 


4920 


tgcgctgcag 


gtccaggaac 


agcacggagc 


ctgtcaaccc 


gggggcacgg 


aaggctccaa 


4980 


tggtggtctc 


cggctcgaag 


tgctgcagca 


cgccgagcac 


ccgggtggcc 


gtcagcccca 


5040 


cgttccggcc 


ggccagctcc 


gtgaggatga 


agagggcctc 


gggtaccggg 


ggggagccca 


5100 


gcagcgcggg 


gaccgagcag 


atgggccaca 


gcacctgggc 


gtgcgggaag 


atgccggcca 


5160 


ccggcccgct 


ctgcacgggc 


aggacgctga 


aggcctcccc 


tcgcgatatc 


acctgggaga 


5220 


cgaagcccag 


cgggacgccg 


aacaggcgcc 


cctgcgattc 


gaagacgagc 


gccttgtcgg 


5280 
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gcgtggactc ggcccagtgg acggggcgca gggccgcggg cgcggcacga gggcccaccc 5340 

ggtggggcag cgcctcggcg atgagctcca ggtagagccg ctccttgtgc agcaccgcgc 5400 

cccggctcag gggggccagc gagtccgcca gcccgggggg caggaggaag aacggcgccc 5460 

gcgccacgtc ggcgacctcc accacggagc gcacccgcac ggccagggtg ggactcacgt 552 0 

cgagcaccac caccatgccc ggcccctcct cgggcaagcc gccgagcagg gccgacaggt 5580 

ccttcacctc cagcacgcca cgcaggctgg agccgttggc gccgggcatg gccacttcca 5640 

tgacggaggt ggcctccacc gcgtagcgcg tctctcctgc ctccacgaga aggcagagcc 5700 

ggcgtccgct ttcgaagg gtg aca egg ccc tac cct age age ctt cct cgc 5751 

Val Thr Arg Pro Tyr Pro Ser Ser Leu Pro Arg 
1 5 10 

ctt ggt ggt aag cca teg age cag atg gcg cgc ate etc etc gtc gac 5799 
Leu Gly Gly Lys Pro Ser Ser Gin Met Ala Arg He Leu Leu Val Asp 
15 20 25 

gac gaa aag ate gcc cgc acc etc tac ggc gac tac etc acg gcc gtg 5847 
Asp Glu Lys He Ala Arg Thr Leu Tyr Gly Asp Tyr Leu Thr Ala Val 
30 35 40 

ggg cac gcc gtc acg gcg gtg ggc teg etc cag gac gcg egg gag gcg 5895 
Gly His Ala Val Thr Ala Val Gly Ser Leu Gin Asp Ala Arg Glu Ala 
45 5 0 55 

ctg gcc ggg gac cgc ttc gac gcg gtg gtg acg gac etc ate etc ccc 5943 
Leu Ala Gly Asp Arg Phe Asp Ala Val Val Thr Asp Leu He Leu Pro 
60 65 70 75 

ggc ggc gac ggc atg gag gtg ctg cgc cac gtg cgc gag cgc cac ccc 5991 
Gly Gly Asp Gly Met Glu Val Leu Arg His Val Arg Glu Arg His Pro 
80 85 90 

ggc gtg gag gtg gtg gtc ate acc ggc ctg gag aag gtg gac ccc gcc 6039 
Gly Val Glu Val Val Val He Thr Gly Leu Glu Lys Val Asp Pro Ala 
95 100 105 

gtg cgc gcc ate aag age ggc gcg gcc gag tac etc gtc aag ccg gtg 6087 
Val Arg Ala He Lys Ser Gly Ala Ala Glu Tyr Leu Val Lys Pro Val 
HO 115 120 

gcc ccc gag gcc etc cag cat gcc gtc cgc cgt gcc etc acc acg cga 6135 
Ala Pro Glu Ala Leu Gin His Ala Val Arg Arg Ala Leu Thr Thr Arg 
125 130 135 

gac etc etc cag gag aac gcc teg ctg cgc egg cac gtc gcc atg ctg 6183 
Asp Leu Leu Gin Glu Asn Ala Ser Leu Arg Arg His Val Ala Met Leu 
140 145 150 155 

gag gcc ggc cag cgc ate gcc acc acg ctg gag cgc gag aag ctg gcc 6231 
Glu Ala Gly Gin Arg He Ala Thr Thr Leu Glu Arg Glu Lys Leu Ala 
160 165 170 



tec gcc acc acg age gcg ctg gag tec atg gcc tgc gcc age gcc gtg 
Ser Ala Thr Thr Ser Ala Leu Glu Ser Met Ala Cys Ala Ser Ala Val 
175 180 185 



6279 
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gtg ctg ctg gag cgc gac ggc gac aag ggc ctg egg etc cag ggc gtc 6327 
Val Leu Leu Glu Arg Asp Gly Asp Lys Gly Leu Arg Leu Gin Gly Val 
190 195 200 

cgc gga ctg cct ccg egg cac gag ccg gag ctg gtg gec ace etc gtc 637 5 

Arg Gly Leu Pro Pro Arg His Glu Pro Glu Leu Val Ala Thr Leu Val 
205 210 215 

gag cgc etc teg aac gcg cgc gcg ccc egg gag ctg gaa ggg ctg gac 6423 
Glu Arg Leu Ser Asn Ala Arg Ala Pro Arg Glu Leu Glu Gly Leu Asp 
220 225 230 235 

gee gec ttc ccg cgc gtc etc tec ate ccc gee gtg gag ggc acc acg 6471 
Ala Ala Phe Pro Arg Val Leu Ser lie Pro Ala Val Glu Gly Thr Thr 
240 245 250 

gtg ctg ggc cac gcg gtg etc ttc ttc gac ggc gee gag ccg gag tgg 6519 
Val Leu Gly His Ala Val Leu Phe Phe Asp Gly Ala Glu Pro Glu Trp 
255 260 265 

acc ggc gag acg gcg ggc tac ctg gtg cgc aac tgg gca ctg gcg ctg 6567 
Thr Gly Glu Thr Ala Gly Tyr Leu Val Arg Asn Trp Ala Leu Ala Leu 
270 275 280 

cgc aac etc ggg cgc ttc gcg gcg gtg gag gac ctg gcg tac gtc gac 6615 
Arg Asn Leu Gly Arg Phe Ala Ala Val Glu Asp Leu Ala Tyr Val Asp 
285 290 295 

gac etc acc cgc etc ttc aac acg cgc tac ctg cac ctg gtg ctg gac 6663 
Asp Leu Thr Arg Leu Phe Asn Thr Arg Tyr Leu His Leu Val Leu Asp 
300 305 310 315 

cgc gag gtc cag gac tec gtc cag acg cag cgc ccc ttc age ctg ctg 6711 
Arg Glu Val Gin Asp Ser Val Gin Thr Gin Arg Pro Phe Ser Leu Leu 
320 325 330 

ttc ctg gac ctg gac cac ttc aag tec ate aac gac acg cac ggc cac 6759 
Phe Leu Asp Leu Asp His Phe Lys Ser He Asn Asp Thr His Gly His 
335 340 345 

etc gtg ggc teg egg ctg ttg gtg gag acg gec egg gtg gtg aag ggc 6807 
Leu Val Gly Ser Arg Leu Leu Val Glu Thr Ala Arg Val Val Lys Gly 
350 355 360 

tgc gtg aga gac cac gac gtg gtg gec cgc tac ggc ggc gac gag tac 6855 
Cys Val Arg Asp His Asp Val Val Ala Arg Tyr Gly Gly Asp Glu Tyr 
365 370 375 

9"tg gtg etc ctg cgc aac acc gac tec ggc ggc gcg etc aag gtg gcg 6903 
Val Val Leu Leu Arg Asn Thr Asp Ser Gly Gly Ala Leu Lys Val Ala 
380 385 390 395 

gag cgc ate cgc cgc acc atg gag acg cac ctg ttc etc gcg cgc gag 6951 
Glu Arg He Arg Arg Thr Met Glu Thr His Leu Phe Leu Ala Arg Glu 
400 405 410 

ggg ctg teg ctg aag etc age acg tgc ate ggc gtg gec age ttc ccc 6999 
Gly Leu Ser Leu Lys Leu Ser Thr Cys He Gly Val Ala Ser Phe Pro 
415 420 425 



gag cac gee egg gac aag gec acg ctg ctg gac ctg gcg gac egg gec 
Glu His Ala Arg Asp Lys Ala Thr Leu Leu Asp Leu Ala Asp Arg Ala 
430 435 440 



7047 
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atg tac egg ggc aag cgt ggc tec egg aat gtc gtc tac atg gcg gec 
Met Tyr Arg Gly Lys Arg Gly Ser Arg Asn Val Val Tyr Met Ala Ala 
445 450 455 



7095 



ctg gat ctg gag ccg ctg ccg gcg gag egg cgc cag ggc ggc age ggc 
Leu Asp Leu Glu Pro Leu Pro Ala Glu Arg Arg Gin Gly Gly Ser Gly 
460 465 470 475 



7143 



ccc ggc ggc tec age acg tag cggctcagcg gcgcaggttg cccacggtga 
Pro Gly Gly Ser Ser Thr 
480 



7194 



ggcgcttggt 


cggggtggtc 


tccacctcgg 


cgcgtgctcc 


cccgccctcc 


ttgcccaccg 


7254 


cctccacctc 


ctcgtgcggg 


gcaccggtgg 


tgatgagccg 


gtacttctcc 


gaggcctcgc 


7314 


ggtcatgctg 


ggcctgctcg 


gggtccaacc 


gggcaatcag 


ctgccagaag 


agggeggegt 


7374 


gctcgcgctc 


ctcgtccatg 


acgtgacgga 


ggatggcctt 


ggcctcctcg 


ttgtcggtgg 


7434 


catcgatgtg 


ggeggegtag 


aggttgatgg 


cgtccagctc 


cgcctcgatg 


ttcagccgga 


7494 


tggageggge 


cagcteggag 


teggtcatet 


tccggggcac 


cagcgagtgg 


aacgggttgg 


7554 


tetgeggcat 


gaaggccctc 


ccgaggccgg 


tggtggcgtc 


cacgttctgg 


acggggcgcc 


7614 


aggggacgeg 


aggagcatcc 


gcgcggccag 


gggcccgggt 


caaeggaate 


cctccgcccg 


7674 


cgttgcgtag 


ataccgctgc 


ctccgccatg 


ctcgggccat 


ggaaacaccc 


gcgccgctct 


7734 


cccagttgct 


ccaggccctg 


gaggegggag 


atctgccggc 


cgcgcgcgcg 


geggeggegg 


7794 


ctctgcaacg 


tgcgggcgcg 


cactcggcac 


age t ggc ggc 


ggaggtgctg 


cacgagctgc 


7854 


ggcagcccct 


gctgggcgtg 


aaggcctacg 


cgcagctgct 


cgeggaggae 


ggtggcccca 


7914 


ccggcccgct 


gcggctgctg 


ctggcgcagg 


tggageggat 


ggaacagatc 


gtctccgact 


7974 


acatccgcct 


cgccagcgag 


cggcacgcgc 


cgcagcagcg 


cctgtcgctg 


gcggcgccca 


8034 


tetgggegge 


ggcgaagctg 


ttcagegtea 


acccggactc 


ggcgcgcatg 


tctctggagg 


8094 


tggaggegee 


cgaggacatc 


accatccagg 


gcaacgcgcg 


gcccaucgag 


cagctcacgc 


O JLD *t 
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gaacaacgcg 


cgcgacgcca 


tggccggccg 


egggegegtg 


aaggtggtgc 


8214 


tctcgcgcga 


gggcgcctct 


ccggtgctgt 


acgtggcgga 


ctggggcccg 


ggcatccccg 


8274 


tggagctgcg 


cgagcgcatc 


ttcgagccct 


acgtcaccgc 


caacaagcgc 


ggcaegggae 


8334 


- ~~cj oc t crcrc 


ggtgtgcaag 


cgcatcgccc 


aggagcacaa 


cgegacgatt 


ggcctcgcgg 


8394 


cgccgggcgc 


cattcgcgac 


gtgcccccgc 


cggccaccgt 


gttccgggta 


ctcttccccg 


8454 


ccacggacgc 


gccgccgcag 


ctgcgcaagc 


ggctgctggt 


ggtggacgac 


gagaccatca 


8514 


teegcatrggt 


cttccgcgac 


ctgatgggca 


aggagtgega 


ggtcatcgag 


gcggccagcg 


8574 


- ~ g c'a' y ——a C 


gctggatttg 


ctgcgccagg 


cgccagtgga 


cctcatcgtc 


aeggacaaga 


8634 


= t- ty- n nna 


cctgtccggg 


ctggagctgg 


cgcagcaggc 


gcggcggctc 


tactccaact 


8694 
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cgcgcgtcat 


cctgatgacg 


ggctacccgt 


cgctggtgac 


gacgcagcag 


gcgctggagc 


8754 


tgggcgtggt 


ggactacctg 


ctcaagccct 


tcgacgacat 


ccggcaggtg 


cgcgggctgt 


8814 


tgcgcacgac 


gctgtcctcg 


gagccgccgg 


tgcctccggt 


ggtggccccc 


ggggccgaag 


8874 


tgcggcgggt 


ggacgtgctg 


gaggacaacc 


cgacgacggc 


gcggctcatc 


tccgaggccc 


8934 


tggacatgct 


ggggctggag 


gcgcgggtgc 


tgccctcgac 


ggagctgatg 


gcgatggcga 


8994 


cgccggtggg 


cgtggtggtg 


agctgggact 


tcgcgccggc 


ctatgggcgc 


aaggcgctgg 


9054 


agctggccaa 


ggcgctggcc 


cagggcgccc 


ccttcgtcgt 


gctggccgag 


cacctcacca 


9114 


tggagacggc 


gctggagtcg 


ctgcgcgccg 


gtgccgccgc 


gtgccttccg 


aagctgctgt 


9174 


ccgataccac 


ggcgctcagc 


cgtgagctga 


gccgcgcctt 


caagagagag 


gtcccatgag 


9234 


actcgcgctc 


ctgtccgtcc 


tgctcttcgc 


ggtggcctgt 


tcctcggaca 


agcccccgcc 


9294 


gacggagcgg 


ccggacgggg 


gcgcgacgga 


ggactccggg 


acgggcggag 


aggacgcggg 


9354 


cccgacggag 


gacggaggca 


cggagccgga 


tggtggctcc 


ggctccgatg 


cgggcacgga 


9414 


tgcgggagcg 


ggcgacgcgg 


gcacggtgga 


tggcggaagc 


tgtgaggcgc 


ccgcggccga 


9474 


gcccgatacg 


gagctggcca 


cgcgactgaa 


cacgccccgc 


cggctcgcgg 


tggacacgac 


9534 


ggacatctap 


atctccgagt 


cacactccct 


gaacccccag 


cagccgagtc 


cgggaccggg 


9594 


acaggttctg 


aggctgcccc 


gcgcgggagg 


tccacccacg 


gttctggcca 


cgggcttccg 


9654 


cgcaccagac 


gccatcgccg 


tggatgggac 


gagcgtctac 


gtgctcgacc 


tggacgggct 


9714 


ctggcgggtg 


gacaaggcaa 


ccgggaagcg 


gggcgacctg 


cccatcgacg 


cgacggatcg 


9774 


caacgtcacc 


gtcggtggta 


ccgaggtgct 


ccgcgccacg 


ctcgccggac 


gcgacgtgct 


9834 


ggtggtcgct 


acggggcggc 


agtcgctggt 


ccgggtggac 


accgatggcg 


gcaactccca 


9894 


ggttctctac 


accggtcccg 


gcggctccct 


ggtgcgtggc 


gcgcgagtgg 


tggactcgga 


9954 


cgtgtggttc 


ctcgtcagcg 


cggggacggg 


agccggaaac 


agctcgggtc 


tgtaccgcgt 


10014 


cccgctggac 


ggcagcgcgc 


ccgcggagcg 


gcgggacgcc 


accatcctcc 


aaggcaattc 


10074 


actggaggtg 


acccccacgc 


acttcctcgt 


caccgagggc 


ggtgggggta 


ccggccgggt 


10134 


gttgcggctg 


ccccgcgctg 


gaggcaccgc 


cgaggtgctg 


gccgacggtc 


tccagggccc 


10194 


gtggttcccg 


gtggaactga 


atgggaccat 


ctacttcaag 


gagtcacgtg 


ccgacggcgc 


10254 


ggacttcctg 


cgccgcgtgc 


gcacgtgcgc 


gctggggacg 


tcggatccgg 


tggggccccc 


10314 


gggcacggga 


ccaggcgggc 


tcatcgtgga 


cggcagcacc 


ctcctctaca 


cgtcgcagga 


10374 


gagcggcacc 


ggcggcgccg 


tgggccgcgt 


gccctgacgc 


cctcctccac 


ccccgaaaag 


10434 


accacggccc 


cggcgcctcg 


cgaggaggca 


tccggggccg 


cgtcctgcat 


cagtggaaca 


10494 


cctcaccggc 


tcggcgggcc 


cgaggcctca 


tcggccccgg 


accgccgtga 


gcggctcagt 


10554 


agtccatgtc 


gtcgccgccg 


tagtccggca 


tcccggcgcc 


gccgccgttc 


ttgcccttct 


10614 
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tcttggcggg gcggtcggcg atcatcgcct 


cygcggncag 


cagcagcgag 


gccacggacg 


cggcgttctg 


cagcgcggtg 


cgctcgacct 


n_y t-cggycc 


ya.Lyd.Ly Luy 


yLLLUL LCLd 


ggtcctcgta 


gacctcggtg 


cgggcgttgt 


agccgaacgc 


yCCCLyyCCC 


tcgcggacct 


tgttgatgac 


caccgcgccc 


tcgacgccgg 


cgttggaggc 


gatcttccgc 


agcggctcct 


ggagcgcgcg 


gcggatgatc 


tccacgccga 


agucctigc tc 


accgcccagc 


tccagcttct 


ccagcgcggg 


caggcagcgc 


aggaaggcca 


cgccgccgcc 


agggacgatg 


ccctcctcga 


cggccgcgcg 


agtcgcatgc 


agcgcgtcct 


t~* r~* £5 r" 1 rxf nTrrTr 1 


CtLCLLCLCC 


llcqlclc gg 


tctcggtcgc 


ggcgcccacg 


ttgatgacgg 


LLacyccycc 


CaCCayCLLy 


gccagccgct 


cctggagctt 


ctcgcggtcg 


tagtcgctgg 


tgacggtgtc 


gatctgcgag 


eggatgaget 


tgatgcggcc 


ctcgatctcg 


ctcttggtgc 


cggcgccguc 


gacgatggtg 


gtgttgtcct 


tgtccaccgt 


gatgcgcttg 


gagcggccca 


ggccgccgag 


cgtcaggttc 


tegtacttgt 


ggcccagctc 


ctcgctgacg 


accatgccgc 


ccgucagggc 


ggcgatgtcc 


ttcagcatct 


ccttgcggcg 


gtcaccgaag 


cccggcgcct 


4^ a /^t y% ^"r ^"«r 

tCaCCyCyyC 


cacgttcagc 


acgccgcgga 


tcttgttgac 


caccagggtg 


gccagcgcct 


cgccc ccgat 


gtcgtcggcg 


atgatgagca 


gcggcttccc 


ggagcgcgcc 


acctgctcca 


ggatgggaat 


catgtcctgc 


atcgacgaga 


ccttcttctc 


gctgatgagg 


atgaaggggt 


/^r 4™ <— i 4** 

CytCCaCyac 


gacctccatg 


cgctcgcggt 


tcgtcacgaa 


gtacggagag 


acgtagccgc 


ggucgaac ng 


catgccctcc 


accacgtcga 


ggttggtctc 


caggcccttg 


gcctcctcga 


cyytyaLyaC 


gcccccc l eg 


cccaccttct 


ccatcgcgtc 


cgcgatgatg 


gagccgatgg 


4" ^3 4~ ✓""I ^5 yi 

tCCCaLCaCC 


gttggcggag 


atggtgccca 


cctgggtgat 


ggccttcttg 


tccgccgtgg 


4™ 4~ /T" /"*■* ^ y"« 

gcuuggacag 


cttcttcagc 


tcctccacga 


cgacctcgac 


ggccttgtcg 


atgccgcgct 


ugaggt-cca u 


ggggctgtgg 


ccggcggcca 


ccagcttgag 


gccctcctcg 


tagatggcac 


gcyocaycac 


cgtggcggtc 


gtggtgccgt 


cgccggcctt 


gtcggaggtc 


ttcgacgcga 




catctgcgcg 


cccatgttct 


cgaacttgtt 


gtcgaggtcg 


atctccttgg 


cgacggcgac 


gccgcccttg 


gtgatcgtgg 


gcgaaccaaa 


gctcttctcg 


atgaccacgt 


LyL.yyL.L-L. l l 


59"ggcccagg 


gtcaccgcca 


cggcatccgc 


gagggtccgg 


acaccgcgaa 


yydLyyLCuC 


acgcgcggac 


tggtggaaga 


aaatctcctt 


cgctgccatc 


gcaacccccc 


yadd.LLd.CCy 


/*f /vrr n 4— 4— 4— — % — « 

yyggatttaa 


ctgtgacgac 


gctggccgtt 


agcactcggc 


catggcgagc 


gccaacataa 


gggecagega 


aatgatgtca 


accaggaagg gcgtgcctgt 


gggcgagcgg 


acggggccag 


ggcccctcgg 


ggccgtctgg 


ccgggyy 


actcggcgaa 


gcgcacgagg 


ttgagcagct 


gacccatgtc 


caggggtttg 


w=l — ~, - ^ - 


gcgccacgcc 


cttgcgcagg 


ccccggtcgg 


tgacgccctc 


gtcctgggcg 



10674 
10734 
10794 
10854 
10914 
10974 
11034 
11094 
11154 
11214 
11274 
11334 
11394 
11454 
11514 
11574 
11634 
11694 
11754 
11814 
11874 
11934 
11994 
12054 
12114 
12174 
12234 
12294 
12354 
12414 
12474 
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cccgtcatca gcagcacggg aatggcctgg aagcgggggt cggccttgag catctccgtg 12534 

aagtggatgc cgtccaggtg gggcatgagg taatcggtga tgaccatgcc cacctggttg 12594 

cgctccagga tgtcgagcgc ctgcagggca tccgccgccg agtagaccgt gtagccatgc 12654 

atctccagga agcgagagag catggtgcac agctcgagat cgtcatcgac gacaaggatg 12714 

ttcacggggc gggccgtcac acatggacgg gcgccggaac ctaacagccg tgcccgtcgt 12774 

tgacaacgca cagcgtgcct tcatatgtgc cggcccatgg ggaagcggac cgcgaagcga 12834 
gagggg gtg ggt ggc gtg gag gct cgg cfc 

Val Gly Gly Val Glu Ala Arg Leu Asp 111 val 111 Asp Sa 
48 5 490 495 

Phe SS S* ri Q ! aC gCC gCg Ctg gcc ggg gcg ga S W ctg 12930 

Phe Glu Ala Gly Asp Phe Glu Ala Ala Leu Ala Gly Ala Glu Gly Leu 

500 505 510 

ctg gcg gac gcg ccc gag ctt ccc gag gcc etc cac ttc cg g gc t tea 12978 
Leu Ala Asp Ala Pro Glu Leu Pro Glu Ala Leu His Phe A?J Sa sir 
515 520 525 

r tg 5*? Sf a ttg Sgg ° gg Ctg gag ga 9 CTCC am aag gcc ttc ggg 1302 6 
Ala Leu Val Glu Leu Gly Arg Leu Glu Glu Ala Gly LyI Ala Phe S£ 
530 535 54Q 

cag gcc ctg aag gtg gcg ccg gag gac ctg gaa ate ctg ctg age gcc 13074 
Gin Ala Leu Lys Val Ala Pro Glu Asp Leu Glu He LeS LeS Ser Sa 

550 555 

gcg gac tgc ctg gtg tgc cgc gcc ggg gag gac cgg gag gcg gtg gcg 13122 
Ala Asp Cys Leu Val Cys Arg Ala Gly Glu Asp Arg Glu 111 ? a ? Sa 

565 570 575 

gag ggg etc gcc ctg tgc gcg cga ggc cgg cgc ctg gcc cag aag gcc 1317 0 
Glu Gly Leu Ala Leu Cys Ala Arg Gly Arg Arg LeS Sa Gin JJs Sa 
580 585 59Q 

gac gac gtg gag atg ctg tac gag ttc etc etc ctg gag ggc atg ggg 13218 
Asp Asp val Glu Met Leu Tyr Glu Phe Leu Leu Leu Glu lly Mec S£ 
595 600 60 5 

etc aac cag atg ggc gag tgc gcc acg gcg ctg gtg age ctg gac gcg 13266 
Leu Asn Gin Met Gly Glu Cys Ala Thr Ala Leu Val Ser LeS Sp III 
610 615 620 

gcg etc ggc cac atg ccg cgc teg ctg gac gcg cag gtc gag cgg ggc 13314 
Ala Leu Gly Hxa Met Pro Arg Ser Leu Asp Ala Gin Val Slu Arg Sly 
625 630 635 

ate gcg ctg ttc gag ctg tgc cgc ttc gac gag gcg aag gcc gct ttc 133 62 
lie Ala Leu Phe Glu Leu Cys Arg Phe Asp Glu Ala LyS Sa Sa Phe 
640 645 650 655 

r a n r tg T 3ag gaC 3Cg ° Cg gac gac ccg tgg gc <? ca ^ cac tac 13410 

Glu Lys Val Leu Lys Asp Ala Pro Asp Asp Pro Trp Ala His His Tyr 

660 665 670 

ctg ggg etc ate gcc gag cgg cgc ggg gac gag aag gag gcg aag cgg 13458 
Leu Gly Leu lie Ala Glu Arg Arg Gly Asp Glu Lys Glu Sa Lys £™ 
675 680 685 
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cgc ttc gac aag gcg egg gcg ctg gtg ccc gag gag ttc ccc ccg ccg 13506 

Arg Phe Asp Lys Ala Arg Ala Leu Val Pro Glu Glu Phe Pro Pro Pro 
690 695 700 

gtg gag ctg gec gag gcg gag ttc gac cgc gcg gtg gag gac gcg gtg 13554 
Val Glu Leu Ala Glu Ala Glu Phe Asp Arg Ala Val Glu Asp Ala Val 
705 710 715 

aag tec ctg ccc cgc cac gcg aag cag tac ctg gac aac gtc acc ate 13602 
Lys Ser Leu Pro Arg His Ala Lys Gin Tyr Leu Asp Asn Val Thr lie 
720 725 730 735 

gee gtg gag gac ctg ccc teg gac gag gat ttg ctg ggg cag gac ccg 13650 
Ala Val Glu Asp Leu Pro Ser Asp Glu Asp Leu Leu Gly Gin Asp Pro 
740 745 750 

ccg ctg tec ccg age ate etc ggg gtg ttc cgc ggc acg ccc gtg ggc 13698 
Pro Leu Ser Pro Ser lie Leu Gly Val Phe Arg Gly Thr Pro Val Gly 
755 760 765 

gag egg age gtg atg aac gcg tat gag etc ccc gcg tec ate gtg etc 13746 
Glu Arg Ser Val Met Asn Ala Tyr Glu Leu Pro Ala Ser He Val Leu 
770 775 780 

tac cag cgc aac ctg gag cgc ttc gcg agg acg cgc gag gaa etc ate 13794 
Tyr Gin Arg Asn Leu Glu Arg Phe Ala Arg Thr Arg Glu Glu Leu He 
785 790 795 

gag cag att gga ate acc gtg atg cac gag gtc ggt cac etc atg ggg 13842 
Glu Gin He Gly He Thr Val Met His Glu Val Gly His Leu Met Glv 
800 805 810 815 

ctg gat gag gac gac ctg tgg cag egg ggg ctg gac tga egagegagge 13891 
Leu Asp Glu Asp Asp Leu Trp Gin Arg Gly Leu Asp 
820 825 

ccgcgtggct tccagcgtct ggeggatgee ctegctgtag ggcgtcttgc gcacgctgcc 13951 

cagcagcttg tgcagcgccg agtegtccat gaggaeggge teggtgagea ggtagtgcat 14011 

ctcgaccatc tegegcatga acgggttgaa gaggecgatg aggeggagea tccccttccc 14071 

cgccgtcatg aacttcgega ggtggccggc ctgggcgtag atctcctcga egagcttgeg 14131 

ctgtgaggtg acgccggcgc eggegaggtt ccaccagcgg ccgtaggcac egggggegtc 14191 

catgagggee gtcacggtgg gacccacgtc gggcaegtag aegtactegt gcggcgggtc 14251 

gatggggccg atgagctggg cgcgcttgcc ctgggcggcg gcgacgaagg cggagtggag 14311 

gaagctcttg tegatgeegg ggccgtagaa gtcegggagg eggaggatgg tggegeggag 14371 

ctttccggcg gcgtccgcgg cgaggaggat gtcctcctgc tccttgcgca tgcggccctt 14431 

gaaggtgtgc ggctcgcggg ggtggtcctc ggtgacgggc gtggtgcggg ggcggccgta 14491 

egggtacacg gtgccgatga gcaccacgcg ctcgacgccc teggegatgg cggcgtccac 14551 

ggtgcggcgc atgagctccg ggtggagctg gaagcgccag tagtccacgc cgaccatgta 14611 

gatgagegtg cggatgcecc gggeggegge gcggatggtg tccggcgcgt eggggttcca 14671 
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ggtggcgatc tccgccttcg ggtccgcgcc gaactgcgac tgaagcgagg cgcgcgagcg 14731 

gccaaccacc cggtacgccc tcccctgctc gcgaagagcc ttcacgatgc tgtgtccgat 14791 

ggggcccgac gctccaaaca acgccacctt gtccgtatcc atggtccgct ccctgggttc 14851 

cgaacggcga gcgttttatg aacgtcgttc attgtgaacg gtgttcatct actgctggcc 14911 

gttcaggacg tcaagcccta cagtggggat atg ggg att teg gag egg aag gag 14965 

Met Gly lie Ser Glu Arg Lys Glu 
830 835 



cgc cag egg gcg gag ctg cgc gag cag ate etc cag gtg gcg aag gac 
Arg Gin Arg Ala Glu Leu Arg Glu Gin He Leu Gin Val Ala Lys Asp 
840 845 850 



15013 



atg gtg acg cga gag ggc ttt ggt gee etc teg atg cgc aag ctg gcg 
Met Val Thr Arg Glu Gly Phe Gly Ala Leu Ser Met Arg Lys Leu Ala 
855 860 865 



15061 



gac gcg gtg gag tac gcg ccg gcg acg etc tac ctg cac ttc gag aac 
Asp Ala Val Glu Tyr Ala Pro Ala Thr Leu Tyr Leu His Phe Glu Asn 
870 875 880 



15109 



egg gac gcg att gee egg gag ctg tgc ate egg ggg ttc cag gac ctg 
Arg Asp Ala He Ala Arg Glu Leu Cys He Arg Gly Phe Gin Asp Leu 
885 890 895 



15157 



ctg gag gcg ttc gag ccg gcg gca tec gtg gag gag ccg gtg gag egg 152 05 
Leu Glu Ala Phe Glu Pro Ala Ala Ser Val Glu Glu Pro Val Glu Arg 
900 905 910 915 

ctg tac egg ctg ggc gag gee tac gtg aag ttc ggg ctg gag cag ccg 15253 
Leu Tyr Arg Leu Gly Glu Ala Tyr Val Lys Phe Gly Leu Glu Gin Pro 
920 925 930 



gag acg tac egg etc ate ttc atg gag gac ccg aag ctg teg acg gcg 
Glu Thr Tyr Arg Leu He Phe Met Glu Asp Pro Lys Leu Ser Thr Ala 
935 940 945 



15301 



ctg ttc agg gac gcg ccg gag gac ggc gcg ggg ccg egg tec ttc ggg 15349 
Leu Phe Arg Asp Ala Pro Glu Asp Gly Ala Gly Pro Arg Ser Phe Gly 
950 955 960 

gtg ctg gtg aag gtg ttc gag gac ctg aag gcg gcg ggg agg att gec 15397 
Val Leu Val Lys Val Phe Glu Asp Leu Lys Ala Ala Gly Arg He Ala 
965 970 975 

gag gac gee gag ccc teg aag ctg gee gag gtg ctg tgg gec ggg gtg 15445 
Glu Asp Ala Glu Pro Ser Lys Leu Ala Glu Val Leu Trp Ala Gly Val 
980 985 990 995 

cac ggc ate gtg gcg ctg egg ctg acg tgt acg ggc ttc aag ggc 15490 
His Gly He Val Ala Leu Arg Leu Thr Cys Thr Gly Phe Lys Gly 
1000 1005 ~ 1010 



teg ccg ccg gag gag ctg gcg egg ctg ttg gtg tec acg atg gtg 
Ser Pro Pro Glu Glu Leu Ala Arg Leu Leu Val Ser Thr Met Val 
1015 1020 1025 



15535 



aac gga ttg ccg ggg ttg aag ccc gtg aag gcg gcg egg aag aca 
Asn Gly Leu Pro Gly Leu Lys Pro Val Lys Ala Ala Arg Lys Thr 
1030 1035 1040 



15580 
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egg tga geegggegtt ggttgacggg gcccgtgggg tgcggcagta ategaeggtg 
Arg 



15636 



gtagggcetc cggacctggt ggccggcccg gtcttcaaaa ccggtgaggg gegctgagag 15696 

gcgtccctgg tgagttcgat tctcatgctc taccgccatg ttttccttgg ggttttctga 15756 

cgcggccctg atcgggttgc aggacgcccc acccgcctgt acctctcgta cccaagtttt 15816 

acccaagtga ggggggtacg ggagccgcgc atg acg aag tgg age ggg aag tgg 15870 

Met Thr Lys Trp Ser Gly Lys Trp 
1045 



ate gga ggc cgc acg tac acg gec acc gac gga age acg cgc tgg 
lie Gly Gly Arg Thr Tyr Thr Ala Thr Asp Gly Ser Thr Arg Trp 
1050 1055 1060 



15915 



ate ate cgc aag acc gtc gec ggg gtc gcg tac aac gtc acg ctg 
He He Arg Lys Thr Val Ala Gly Val Ala Tyr Asn Val Thr Leu 
1065 1070 1075 



15960 



gac gtg egg age gag gee gag gee ctg get gag ctg gcg gee ttc 
Asp Val Arg Ser Glu Ala Glu Ala Leu Ala Glu Leu Ala Ala Phe 
1080 1085 1090 



16005 



cgc cgc aac ccg gee gcg tac egg acc gcg age cag gag cgc caa 
Arg Arg Asn Pro Ala Ala Tyr Arg Thr Ala Ser Gin Glu Arg Gin 
1095 1100 1105 



16050 



gac gag gee caa cag gec 
Asp Glu Ala Gin Gin Ala 
1110 1115 



cag gag gac gec gcg cag gcg gtg ttc 
Gin Glu Asp Ala Ala Gin Ala Val Phe 
1120 



16095 



ctg gac gag gat cgt gec egg egg ttc etc cag cat ctg aag tec 
Leu Asp Glu Asp Arg Ala Arg Arg Phe Leu Gin His Leu Lys Ser 
1125 1130 1135 



16140 



age ggc agg tec gac gag tac ctg aag age acg cgc age tac ctg 
Ser Gly Arg Ser Asp Glu Tyr Leu Lys Ser Thr Arg Ser Tyr Leu 
1140 1145 1150 



16185 



gee gcg tgg gca acg gcg etc gcg gac aag gac ctg cgc gcg gtg 16230 
Ala Ala Trp Ala Thr Ala Leu Ala Asp Lys Asp Leu Arg Ala Val 
1155 1160 1165 

acc gga ccg gcg ttg aag aag ate ctg gee acc tgg gac acc ggc 16275 
Thr Gly Pro Ala Leu Lys Lys He Leu Ala Thr Trp Asp Thr Gly 
1170 1175 1180 



aac aag tgg agg ate ate 
Lys Lys Trp Arg He He 



j. x O 



gtc ttc aag tec ttc acc teg ttc ctt 
Val Phe Lys Ser Phe Thr Ser Phe Leu 
1190 1195 



16320 



cag 
Gin 



Leu 



gac teg ggc gaa ctg gac ccg atg gtg aac ccc gga cgc ttc 163 65 

Asp Ser Gly Glu Leu Asp Pro Met Val Asn Pro Gly Arg Phe 
1205 1210 

gtc ccc cct gee cgc egg aac cac gaa etc aag ggc tac 16410 
Lys Val Pro Pro Ala Arg Arg Asn His Glu Leu Lys Gly Tyr 
1220 1225 
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age ate gag cac gtc cag cag etc tac gcg gee etc ggc teg cag 16455 
Ser He Glu His Val Gin Gin Leu Tyr Ala Ala Leu Gly Ser Gin 
1230 1235 1240 



gee ate cgc gat gtg ctg tgc ctt cag gee aag acc ggg atg cac 16500 

Ala He Arg Asp Val Leu Cys Leu Gin Ala Lys Thr Gly Met His 
1245 1250 1255 

ggc acc gag gtg gac egg etc gcg age ggt gac ggc aag ate gcg 16545 

Gly Thr Glu Val Asp Arg Leu Ala Ser Gly Asp Gly Lys He Ala 
1260 1265 1270 



gtg ctg aag gac cag ggc gag ate gee gee acc gtc acg ttc aag 16590 
Val Leu Lys Asp Gin Gly Glu lie Ala Ala Thr Val Thr Phe Lys 
1275 1280 1285 



cac aag acc ggg aag ccg ttc acc ttg teg etc gat gtg cag ggg 16635 
His Lys Thr Gly Lys Pro Phe Thr Leu Ser Leu Asp Val Gin Gly 
1290 1295 1300 



etc gee gee gec cag cgc ctt caa gec egg ggt tea gcg ccg gat 16680 
Leu Ala Ala Ala Gin Arg Leu Gin Ala Arg Gly Ser Ala Pro Asp 
1305 1310 1315 



egg cag acc ate cgc gag gec gtg gag cgc gtg tgc gcg cgc acg 16725 
Arg Gin Thr He Arg Glu Ala Val Glu Arg Val Cys Ala Arg Thr 
1320 1325 1330 



age ctg gag ttc gtc cac ttc ggc gcg ate egg cac age ttc gca 16770 
Ser Leu Glu Phe Val His Phe Gly Ala lie Arg His Ser Phe Ala 
1335 1340 1345 

acg tgg ctg gtc gag cag ggc age ate tac aac ccg aag ggc ggt 16815 
Thr Trp Leu Val Glu Gin Gly Ser He Tyr Asn Pro Lys Gly Gly 
1350 1355 1360 



ggc ctg tec ctg gaa gee gtc get cag gcg etc aac cac age agt 16860 

Gly Leu Ser Leu Glu Ala Val Ala Gin Ala Leu Asn His Ser Ser 
1365 1370 1375 

acg egg acc acg gee etc cac tac gtc age gee aca gtg ccg ccg 16905 

Thr Arg Thr Thr Ala Leu His Tyr Val Ser Ala Thr Val Pro Pro 
1380 1385 1390 

atg tac gtc gtc ccc ate cgc ctg gag cat teg aat gat cca teg 16950 

Met Tyr Val Val Pro He Arg Leu Glu His Ser Asn Asp Pro Ser 



1395 


1400 


1405 






gag ttg cgc ata get egg gca cct gcg ggg tga ggctcatgaa 
Glu Leu Arg He Ala Arg Ala Pro Ala Gly 
1410 1415 


16993 


ccagagagtt 


eggagatcag tacgaggagg 


ttcgttgatg 


cgtcaggccc 


caggceggag 


17053 


ccttcaacgt 


tettattgeg ccgaacgctc 


caggtgatta 


ccgcacgcct 


cccacaaagg 


17113 


agecattega 


tgeagatect ggccgtaacc 


gaacctccta 


gtttcggggg 


gaagtgcacc 


17173 


atcaccgacg 


tggcgggcaa cgttgtcggg 


tacctccacc 


tgcgcgtgtc 


ctcgatgctc 


17233 


aacggcagtg 


caggagtcgt aaccgatctc 


catggccgcg 


ccctcatgca 


ggcgcccttc 


17293 


aactttgggg 


tgtgggatct agaaacgega 


eggagggteg 


egatcgegtt 


cgtcgaggcg 


17353 
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gccttcccgg agactcgcgc cgcaattgcc ggagccaagc cgaccagagc ccccatcacg 17413 
gttgcggagt tcggaggcgt gctgcgctcg ctgctccctt ccacgcccac tcaccccgtc 17473 

gggactattc ccgcgtccat tccgatcgac gccatcaaca tcgttgcggc ggacctgtac 17533 
ccgaaccgga cctgcgatag ggagcacttg gaagagttga cgaaggtgct caacacggag 17593 

ttggaggcgc agtaccgcac gcgcctggag tggtcaggag acacgctgca cctgcgcgcc 17653 

aagtagacca tcggcccctc gggcgcggcc agcaaagtct cttccttgat cgaggagtag 17713 

gatgcggcga actacgcctc gatcactgcc ggatcattct tgtcacgatg tcggggggtt 17773 

agagcgtgtc gatccggctg cggatgccag gacaagaagt tctgcctcct ctatcccgtt 17833 

gggccatgtc gaggggcacc aaaaccctcg ctcgtgagct tcggccgctt cgttacggcg 17893 

actttcaacg aggctcaaat cgtgagcttg cggatggcaa cacctgggtt cggatacgct 17953 

ccggctgcgg ttccctcctg cttcccacgc cgctccttgg tgcggactcg ttcaggctgc 18013 

ttggtttgtt gaggccgatt gcctcccttc cgaggggcgc cctccgcgag cagtgcctcg 18073 

atgctcgcca ccgtgatcat cacggaccgt ccgatcttcg gtgcccgctc caaccggcct 18133 

tgctccagga gcttgaagac ctgtgaccgc ttgcagccga gcattgccga cgcgcgctcc 18193 

agcatcacag ccgaggcgtt ctgtgtgacc gattgagcgc gaagcgcacg aacctcgcgc 18253 

ttgatctcgg ccagttcctg gaggatggcg cattccacgg gtgcgggctc gtggtcctca 18313 

atgggcacgg caggtcgaaa gactggaatg ggaataggaa tgaggtccaa cggtggctcc 18373 

ggccgagcaa gcaaggagcc tgtacggata tagctgaagt tctcgatcat ggacttgtac 18433 

tcaaccggtc tccgtcggag gccgtggatg gtgtcgaagg tgtggatcga gccggtggcc 18493 

accagtccgt ccctattctg aatgtatcga atctctccga ggtccttccg ggcgctcgcc 18553 

ctgcccgcct ccgtgccggt gaaggcgaac gggaggcgct tggcccggca cctgagcccc 18613 

gttggacgcc atggatggcc cggaagggtc acagcaccgg ctacttcagc cctttactcg 18673 

tgctgcggcg tcatcgtcct gacttcccca tagccttcca tcccttccga tccttccacc 18733 

tctctctgca accaggggga gaccgaagca gacagcatgg aaggcatcgc cgcagtcgga 18793 

acgcatccgt tccaatcacg ggtgcgcaag gtccatccga gcgccagcgg cgatgctctc 18853 

catcaccccg tcgatctcgg tctcccagcg tttcaccgtg acccggtact tgattccatc 18913 

gctccgcttc cagcccgtgt tctcttcacc ggccaactcc ttcatgcgtt caccgaactt 18973 

cgtgagcgag aacgccatcc ggtcacggcc ctggtacttc aagtcctcga cgtagtggcc 19033 

gtacagegcc gtggccttga tccactggtc ccgctgcgcg caccggcgct cgaagaaatc 19093 

iifcccaccgag tccttggctc gaatctcatc ctggcgggcc tggacctgcg ccaattggcc 19153 

gaggsgrcggg ctctccgcct tctcgtccac gctcgtccac atggccagga agtcgatctg 19213 
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ctccagtgcc gtccagttca tctggggcag acagtccagt tgatagaagc gtcgcattcc 19273 

cgtgggatcg acgacgatct ccttcacctg cttgttggac gcaccgatga aggtggcgtt 19333 
gttggtgccg gtctggagct tctggacacc gagcacccgg tactcgatcc gctcggcggt 19393 
gatgcggttc ttgagcacgt cagacgctac ccgctctgcc ttcgccattt cgtcgaagaa 19453 
catgccgtag cagcgagtca accggaactg ctggcgctca tcggtcagga actggagatc 19513 
ggcggggaac ccaacgagat cgccaagagg ctgtagcagg ctcttgatgg ccgtagtctt 19573 
cccgccccgc tgtttgccga cgaagatcgg catcaggtgg tccttgaccg gcttcccgaa 19633 
caacttccgc ttcacctgcc aaatgaagtg aagaacgacc acgagatcta ccgggtcaag 19693 
gcgtcccgtg gttgctcgaa tgaactcctc ggcgtcacct cggcgtccgt cctcgcggaa 19753 
cttcaacttc tcacgatact gctcgacgat cccccggcgt tggtccgcct cccacaggcg 19813 
cacggcgtta ccgagatttc gcacgctgat gcatgtgcgc tcgttcgtgg actcgatctc 19873 
gatctgggac aagaactcgg acgcaggaaa catccggccc ccaatctgga agccatcgct 19933 
cgcgtacacg aatgtgacgc cgttcttctc catccagtgt tgaacccatg tgcgcggctc 19993 
gaagcggttc tcgaccctcg gcaacgcctg caccaatcgg tcgatgacct cgggcttgca 20053 
gaccttcttc ctcttcaagg tcccgaccca cttcttgaag atgccatgct cggtggagaa 20113 
ccaaaccgag caccagacga gagcgcgctt gatctcgatc tccttcaccc caccgtgcgc 20173 
ctcatgctgt tccacgagcc agtcacagag tgccgaagga agcgccgggc gcgcagaggc 20233 
caagtactgc acgatcagga actgacgagc ccatgcttcc gtcttgtggt gatgagcggc 20293 
aagcaacttc ttcatctcgc gatgcaccca atcatccaca ccggatttct cgtgctgtgg 20353 
gtcgtggtcc acgtacggaa tctgagggtc acatccgcgc gcagaaaggc ccttcgcgat 20413 
ccgcacaact tcacgaacga cgttcatgtt cgtgccgtcc atgtccggcg catcgccgca 2 0473 
gacgaacacc gtgcggccct gtgcaaggta tgatgcggcc tcggggtgcg gtacccactc 2 0533 
gtccgtcagg tcatcccagc ttcgtcgccg ggcctcaaca tcatgggcca tgctgatgcc 20593 
ggggagcccc agcgccgggg tgccttcctg cgcgagcagc gccgccttct tctcgccctc 20653 
aatgagcgcc atggggactt cttcgtcttg aagcatcggg cagaggatcg cggtgtcgaa 20713 
gatctggggc gcgtacccga aaggagcctc gtacttcttc accttcgctg gctccccttc 20773 
gacctcatcc tgggtgtacg ggcgcaggat gtggtccgtg aagg tc gcgg ttgggagggt 20833 
gagctgcgcc tccttcggcc ggagcacgcg cgggcgatgc ggcttgacct tggcgtagtt 20893 
gagccccggg tacacgagca ccagccctcc agcgtccccc ttccattcct cgggccagtt 20953 
caacaacttc cggttgttct cgtccgtgtc cgttcgccaa ccgagtttcc gcaccgtcga 21013 
gggcaggagg caggacttct tcaggtcctc gcagtctcgc gc cgagatcg ga agctcgtt 21073 
ccactcagcc tcgttgcggc tggattcaac ctcgttcgcc ttcgagtcgg gcagtgcctt 21133 
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gtccgagtcc tcatgtccca ttgggtacac ctctcccatc aacgccgaag ccgtgcttcg 21193 

acgccttgaa atgcagggtg gctcttgaac ttgtatcgat tctggctacg tcgttgcggc 21253 

gccctggagc ggagccgggt tcggtgtggc tacaaggaat atatcgaatg tgtctgcgca 21313 

cctcggacgg tgcgccccga gggccacggg catcaacgcc gctgcgcgat cctgcgccgt 21373 

ggacggccag gggattcgcg acgtacaccg gacggcaacc ttccgtgtat gagccgaggc 21433 

acccgctgga gcacggagta cctggacgcg atgctcgcgg cctgggagag gtccgggctg 21493 

acctcggagc agttcacggc cacggccgcg ccagggctgc ctgcgcgtac cctacggctg 21553 

caccgccagc gaaaagagaa ggagatccac gcagagcttc agcggcttcg ggatgaaaac 21613 

gcgacgctcc ggtcgatgct cgaaaggagc gcgttggtcc ctggtggttg ccgggcggca 21673 

accgtcagcg accccaacgg ccacggcatt ggagatcgcc atgtggcaaa cggaaccgaa 21733 

gccccttcgc cgaccgcgac cggcccgcca gcattggagc cggtacagcc gcgcgtggtg 21793 

ccgcaggacc aggttgccgg accctccccc ggtgatcacg atggtcgtgt gctcgtaggg 21853 

gaaatcaccg agccggtgcc cccgaagaga ccggcgttct cctgggattg atggcctcgc 21913 

gcctgctggc cgatgtcgga cgccatctac cgctggtctt ggtccgcgag cgccttcaag 21973 

agcttcttga tctcggccga ggacatcgtg cggatgcggt ccacgaggcg ttgccgctcg 22033 

tcatcctcgg gaaacgtcag gagatccagc ggcaacactc ccaggtggtc cgccagcacc 22093 

ttcaacgtgc gaaggccaag ctcctgcggc cagtgcttag aggtgggact cagggcacgg 22153 

agcagaggca gagcgcgagc accaggcagg cgctccgccg cgctcaccct gccgagccct 22213 

gtggtcgctt gtaataccct ttgggtttat gaatagtaga cctcggaccg cgagagccac 22273 

cggccccgtc cccctcagcc gaggccccac cagggagccg gaaggcccga gccagtgccc 22333 

gccgcgcgtc ccttcccacc tggaggccgc ac atg acc gat gac age acg ccg 22386 

Met Thr Asp Asp Ser Thr Pro 
1420 1425 

f a ° Hf S a ^ g Cag atc gag gcc ccg acs ? ccg ^g aac acc tec 22431 
Thr Asp Glu Thr Gin lie Glu Ala Pro Thr Pro Glu Asn Thr Ser 
1430 1435 144Q 

gac ctg aag ggc gcc cgc ccc cga etc egg ttc acc atc gag acc 22476 
Asp Leu Lys Gly Ala Arg Pro Arg Leu Arg Phe Thr lie Glu Thr 
1445 1450 1455 

^ g ta ° gag aCC Ctc cgc acg ctg gag egg egg ggt aac ggc 22521 
Thr Thr Tyr Glu Thr Leu Arg Thr Leu Glu Arg Arg Gly Asn Gly 
1460 1465 14 70 

gag gtg gtg etc ctg gcc gag cgt cac ctc ccc cat gga ctc gcc 22566 
Glu Val Val Leu Leu Ala Glu Arg His Leu Pro His Gly Leu Ala 
1475 1480 1485 

ggg etc gtc acc atc aag cgc ctg cgc aat ccc gtc act ttc gag 22611 
Gly Leu Val Thr lie Lys Arg Leu Arg Asn Pro Val Thr Phe Glu 
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a™ ^f g C9g ctg atc gag 3ag gtc cag etc tec ttc cgc etc 22656 

Arg Cys Gin Arg Leu He Glu Glu Val Gin Leu Ser Phe Arg Leu 
1505 1510 1515 

utl ^° « Cff 9 ? C gCC Cag gtc cat cac ctg aaa a tc cat gec 22701 

His His Pro Ala He Ala Gin Val His His Leu Lys He His Ala 
1520 1525 1530 

Sfac egg ccg cac gtc atc gec gag tac gtg gac ggc ccc acg eta 22746 
Asp Arg Pro His Val He Ala Glu Tyr Val Asp liy Pro Th? £eS 
1535 1540 1545 

gac ace atc atc age etc gec acc atg cgc gag aag ccg etc tec 22791 
Asp Thr He He Ser Leu Ala Thr Met Arg Glu Lys Pro Leu Ser 
1550 1555 156Q 

111 tit Hi f? C r CtC ** C a , tC gCC 3CC gag gtg gcc gac gcc etc 22836 
Ala Pro Phe Ala Leu Tyr He Ala Ala Glu Val Ala Asp Ala Leu 

1565 1570 1575 

His a?? ^ 9 ° tg ^ gat tCg gag aac cgg ccg ctg 22881 

His His Ala His Thr Leu Arg Asp Ser Glu Asn Arg Pro Leu Gly 

1580 1585 ~ 1590 

atc ate cac cgc gac gtg gcg ccc egg aac atc cgc gtg gcc cac 2?9?fi 
He He His Arg Asp Val Ala Pro Arg Asn He Arg Val 111 A?g 
1595 1600 " 1605 

S° gg ° g f g gtg aag gtg acg gac ttc ggc get acc tac teg etc 22971 
Ser Gly Glu Val Lys Val Thr Asp Phe Gly Ala Thr Tyr Ser Leu 
1610 1615 1620 

atg gtg ggc agg gag gag acg ccg ggc etc ctg etc aag ggc aac 
Met Val Gly Arg Glu Glu Thr Pro Gly Leu Leu Leu LyJ lly Asp 
1625 1630 163 5 

£ ac g f c tcg ccc gag tac ctg aac cgt aag ccc a tg gat 

Val Ala Tyr Ala Ser Pro Glu Tyr Leu Asn Arg Lys Pro Met Asp 
1640 1645 ~ 1650 

ggc egg tec gac atc ttc tcg ctg ggc etc gtc etc atg gag atg 
Gly Arg Ser Asp He Phe Ser Leu Gly Leu Val Leu Met Glu Me? 

165 5 1660 1665 

T ° tg a £ 9 * ffC aag CaC Ctc ttc gac gt S gag gac gaa aag gcc ccc 
Leu Thr cys Lys His Leu Phe Asp Val Glu Asp Glu Lys Ala Pro 

1670 1675 1680 

a 30 ?? C gt ? gaC gtg aag aCg gag gag ac 9 ccc tcg gtg ccc 23196 

Asn Ala Thr Val Asp Val Lys Thr Glu Glu Thr Pro Ser Val Pro 

1 68 5 1690 1695 

r tC ^f g at0 atC gCg ° tC gtc aac cgc tac cgc ccc gag gac 

Leu Thr Gin Met He Ala Leu Val Asn Arg Tyr Arg Pro Glu Asp 

1 7 °0 1705 1710 

ri 9 aa ° ?? g atg g f g gg ° Ctg ° Cg gac gcg etc aag gcc atc 
Val Glu Asn Ala Met Ala Gly Leu Pro Asp Ala Leu Lys Ala He 
1715 1720 1725 

gtc cac aag gcc ctc cag cga aag ctc tec gag cgc tac gcc acg 23331 



23016 



23061 



23106 



23151 



23241 



23286 
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Val His Lys Ala Leu Gin Arg Lys Leu Ser Glu Arg Tyr Ala Thr 
1730 1735 ^ " 1740 

gcc gcc gag atg cgc gat gcg ctg cga gcg gcg etc gcg gcg gag 23376 
Ala Ala Glu Met Arg Asp Ala Leu Arg Ala Ala Leu Ala Ala Glu 
1745 1750 1755 

tea cag ccc ttc ggc egg aag gag gcg age gag gag ctg gcg egg 23421 
Ser Gin Pro Phe Gly Arg Lys Glu Ala Ser Glu Glu Leu Ala Arg 
1760 1765 1770 

atg ctg teg gag gcg tec gtc ctg cgc gac egg gtg gaa ctg gac 23466 
Met Leu Ser Glu Ala Ser Val Leu Arg Asp Arg Val Glu Leu Asp 
1775 1780 1785 

gaa gaa ggc ate ttc ccc gag ggc ctg gac gcc gac gag ccg acg 23511 
Glu Glu Gly He Phe Pro Glu Gly Leu Asp Ala Asp Glu Pro Thr 
1790 1795 1800 

ccc gca ccg aac gaa gag tga gaaegggegg eggtgeatte cttgaagcaa 235 62 
Pro Ala Pro Asn Glu Glu 
1805 

gcgggccccg gtgtgggccc gcgtcgtttc agegcaacaa cacctgggcc accttcacga 23 622 

gggcatcgag ctggtcctcg tccatcttgc gcgccagtcc ggtgagctgg cgcagcttgg 23 682 

gcgccccgcc ctgccctcgc tcccccttgc ccgacttcgt cccctcctcc gcgtccgcga 23742 

tccccagcaa ctcgtccgag gagatgegea gcaccgagca catccgcagc agcgtctgga 23802 

cgctgggcag catcttcccg cgctccaggc ggctgtagac catgtgcgcg aggcccagct 23862 

tctcggccac ctccgcctgc gtgagecega gctgcgtccg ggcctcacgg gcggcacttc 23922 

caataegggt cgccagctct teattcatge gteggggtae caggacaccg agaaccaggt 23982 

ttggcgcggg caggaaggee caagcctcgg ggacgcccct tcctgccgca tctcggcggc 24042 

catgtcgatc cacgagcagg gccccgtcac gcggcgcccc tattggcegg acccatcggg 24102 

ggegggaace tccagggcca ggccgcccag gatgaagtcc aeggggacac ggaggacacc 24162 

gcacaggccc accagcgtgg ggaggctggg cagcaaccgg ccacgctcca ggcggctgaa 24222 

caccagcgtc ggcacgtgga tggcctcggc cacctgggcc tgegacagge ccgcccgatg 242 82 

ccgggcggcg eggacattgg ttccgatctg ctgggagagc ttcgegtaca tgaggagacc 24342 

tgggagaaag aggagaatgc cgcgtgggcg gcactccgag catacagctc accaggttca 24402 

i^aatagtat ccgctcaagg tttacccttc tcccgcgcca ettcgegggg ggaactcgta 24462 

cgagatccgc taccggttga gagcatgtcc aggaacgecg accccacacg ccccgccgaa 24522 

ggagaggaca cttgagcccg cgtctgcatc ccgtcgtccc cccgggcacg gaeateggeg 24582 



gag gag aag ctg gga gcc ggg ggc ttc ggt gcc gtg 24629 
Val Glu Glu Lys Leu Gly Ala Gly Gly Phe Gly Ala Val 
1810 1815 A 1820 

>z*:c cgc gee egg cgt gga ggg egg etc tac gcg etc aag etc ate 24674 
Tyr Airg Ala Arg Arg Gly Gly Arg Leu Tyr Ala Leu Lys Leu He 
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1835 



p™ £ 9 Z? 3 3 gCt gag tgg gcc gag C 9 C S aa 9tg gcc ate 24719 

Pro Leu Trp Gly Leu Ala Glu Trp Ala Glu Arg Glu Val Ala He 

1840 1845 1850 

ft? T Ct ° ^ ? tC aag CaC tCC aat ctg gt S c ^ c atc cgt ggg cat 24764 
Leu Leu Arg Leu Lys His Ser Asn Leu Val Arg He Arg Gly His 

1855 I860 1865 

rit nf 3 l" S° g gat gag gCg CCt caa tcc ttc ttc a tc gtc atg 24809 
Gly Gin Trp Pro Asp Glu Ala Pro Gin Ser Phe Phe He Val Met 

1870 1875 1880 

Sp Tv"r ?a? S£ S?5 S° ^ 9 r ttg ^ gta tgg gCC aga gag gag 24854 
^ Val Glu Gl y ^ ^9 Leu Asp Val Trp Ala Arg Glu Glu 

1885 1890 1895 

aac ccc teg g CC cga gaa gtc gtg etc aag gtg cgt ggc gtg gcg 24899 
Asn Pro Ser Ala Arg Glu Val Val Leu Lys Vai Arg Sy Si 52 
1900 1905 1910 

cgc ggg ctg ggc gcc gcg cac egg gcg aag gtg gtg cat cga gac 24944 
Arg Gly Leu Gly Ala Ala His Arg Ala Lys Val Val His A?g Asp 
191 5 1920 1925 

ctg aag gag age aac gtc atc gag cgc ace tcc gac gga gag gca 24989 
Leu Lys Glu Ser Asn Val He Glu Arg Thr Ser Asp G?y GlS SI 
1930 1935 1940 

S? fr t9 fr t9 ^ C gg ° gCg ggC ggg tac a S c 3CC ccc age 25 034 

Val Val Val Asp Phe Gly Ala Gly Gly Tyr Glu Ser Ala Pro Ser 

1945 1950 1955 

He X? HZ %ZZ S? T° tg T a aCg CCS gag taC cgc gcg 2 5079 

ue Tnr Gly G1 y Val Leu Pro Pro Gly Thr Pro Glu Tyr Arg Ala 

1960 1965 " 1970 

*™ g ?° Cg ° ttC Cag cag gag cac OTO gag cgt ggc 25124 

Pro Glu Ala Trp Arg Phe Gin Gin Glu His Gly Asp Glu Arg Gly 

1 9 75 1980 ' 1985 

H?s Ser Tyr Sin Pro rf S° C ? at ffaC Ctg ta ° tCg ctg ggc 25169 

his ber Tyr Gin Pro Gly Pro Ser Asp Asp Leu Tyr Ser Leu Gly 

1990 1995 2000 

tI C ^ g CtC Ctg aCg ggC agg ca g ccc ttc etc ccg 25214 

Val Val Leu Tyr Trp Leu Leu Thr Gly Arg Gin Pro Phe Leu PrS 

2005 2010 2015 

gac gag gcc gca ggg gtg gag gcc gtg etc aac cgc ace ccc aaa ?R9rq 
Asp Glu Ala Ala Gly Val Glu Ala Val Leu Asn 2g Sa Pro Ss 
2020 2025 2030 

Pro Pro Sis t tC aat CCa Cg ° gt f CCC gag gcc ctg a 3 fc 3 CC 25304 

Pro Pro His val Leu Asn Pro Arg Val Pro Glu Ala Leu Ser Ala 

2035 2040 2045 

fl Cvs ull f tg r tg ^ C gag CCC gag gag Cgc cac ccg 25349 

Val Cys Met Arg Leu Leu Ala Lys Glu Pro G lu Glu Arg His Pro 

2050 2055 2060 

gac gcc gac gcg ctg tgc gcg gag ctg gag gcc ctg ctg gcc cag 25394 
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Asp Ala Asp Ala Leu 
2065 

gcg gac gag tec tgg 
Ala Asp Glu Ser Trp 
2080 

gac acc gec acc acg 
Asp Thr Ala Thr Thr 
2095 

gag ctg gtg caa tgg 
Glu Leu Val Gin Trp 
2110 

ggg cca cgt ccg cca 
Gly Pro Arg Pro Pro 
2125 

aac gcg gtg gec ate 
Asn Ala Val Ala He 
2140 

cgc ccc ggc ccc atg 
Arg Pro Gly Pro Met 
2155 

ttc aag gec ctg cgt 
Phe Lys Ala Leu Arg 
2170 

ttg age gcg ggc gcg 
Leu Ser Ala Gly Ala 
2185 
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Cys Ala Glu Leu Glu 
2070 

gac gtg aaa etc tgc 
Asp Val Lys Leu Cys 
2085 

etc gcg gcg gtg ccg 
Leu Ala Ala Val Pro 
2100 

ctg aag aag cgc aag 
Leu Lys Lys Arg Lys 
2115 

cac ggc gag ggc gca 
His Gly Glu Gly Ala 
2130 

cag gca gag ctg cca 
Gin Ala Glu Leu Pro 
2145 

ccc teg ccg ccc ccg 
Pro Ser Pro Pro Pro 
2160 

gtg gcg acc tgg atg 
Val Ala Thr Trp Met 
2175 

ctg gtg ate gcg cgc 
Leu Val He Ala Arg 
2190 
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cct get cct gec gca 
Pro Ala Pro Ala Ala 
2200 

gee gtg gac tea cca 
Ala Val Asp Ser Pro 
2215 

tgg gee ccc ggc cag 
Trp Ala Pro Gly Gin 
2230 

get cag gaa gee gca 
Ala Gin Glu Ala Ala 
2245 

gec gtc gec etc ccc 
Ala Val Ala Leu Pro 
2260 

aag atg aag aag aac 
Lys Met Lys Lys Asn 
2275 



cgt cac gee ctg gag 
Arg His Ala Leu Glu 
2205 

cct gca tec ccc gag 
Pro Ala Ser Pro Glu 
2220 

gaa gtg gcg gcg cca 
Glu Val Ala Ala Pro 
2235 

gee gcg etc gat age 
Ala Ala Leu Asp Ser 
2250 

gcg acg ate tec gag 
Ala Thr He Ser Glu 
2265 

acc ggc gta ttc tec 
Thr Gly Val Phe Ser 
2280 



Ala Leu Leu Ala Gin 
2075 

gac gee tat ggc ccg 25439 
Asp Ala Tyr Gly Pro 
2090 

cac gcg ggc gaa gac 25484 
His Ala Gly Glu Asp 
2105 

gee egg cct cgt egg 25529 
Ala Arg Pro Arg Arg 
2120 

cac gag ccg gac tea 25574 
His Glu Pro Asp Ser 
2135 

ccc cca gtt cct get 25619 
Pro Pro Val Pro Ala 
2150 

gca caa gec ggt cac 25664 
Ala Gin Ala Gly His 
2165 

ggt ctg ctg gtg gtc 25709 
Gly Leu Leu Val Val 
2180 

acc ttg gcg cct tec 25754 
Thr Leu Ala Pro Ser 
2195 

cct cac ccc agt gat 25799 
Pro His Pro Ser Asp 
2210 

ttc gac etc gcg ccc 25844 
Phe Asp Leu Ala Pro 
2225 

tgg atg gca ccg gaa 25889 
Trp Met Ala Pro Glu 
2240 



gcg ccc acc eta gcg 25934 
Ala Pro Thr Leu Ala 
2255 

gag aag get tec gtg 25979 
Glu Lys Ala Ser Val 
2270 

gag ccc gag ccg cag 2 6024 

Glu Pro Glu Pro Gin 
2285 



26069 
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gcg ggc tgc eta gcg atg gec tgc cct ggc ccc cag gtc cgc ccg 
Ala Gly Cys Leu Ala Met Ala Cys Pro Gly Pro Gin Val Arg Pro 
2305 2310 2315 
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26114 



Th? I t Ca 9 f g gCS tgC ° Cg CCa ggfc gct gtc gag gec atg 

Thr Pro Pro Pro Glu Ala Cys Pro Pro Gly Ala Val Glu Ala Met 

2320 2325 2330 

age agg ctt cga att gag tat gac gag gat gtg gta gec ace ttc 
ser Arg Leu Arg He Glu Tyr Asp Glu Asp Val Val Ala Thr Phe 
2335 2340 2345 

oJf f? g f£ C g f° gag gag ggg gtt agc agg aac afc c ac 9 3tg cgc 
Phe Ala Thr Ala Glu Glu Gly Val Ser Arg Asn He Thr Val Arg 

2350 2355 2360 

g ? g ggC £ Cg a £ a a f g gtt cac ctg a 9a cca ttg gga aat eta 
Ala Gly Pro Thr Thr Val His Leu Gly Arg Pro Leu Gly Asn Leu 
2365 2370 2375 

ccg gee egg ace acg etc tec gga cgc etc ate ate gga gag gat 
Pro Ala Arg Thr Thr Leu Ser Gly Arg Leu He He Gly Glu Asp 
2380 2385 2390 

cgc gtt tac ggc cga eta act gaa gca cgc acg cca aag ggt aac 
Arg Val Tyr Gly Arg Leu Thr Glu Ala Arg Thr Pro Lys Gly Asp 
2395 2400 2405 

cgc ttc cct gta tgc atg cag ate att gag aac gac tct att ggg 
Arg Phe Pro Val Cys Met Gin He He Glu Asn Asp Ser He GlJ 
2410 2415 2420 

ttc aag aag gag ate aac gac ggc tec aac agc gec teg gtc tat 
Phe Lys Lys Glu He Asn Asp Gly Ser Asn Ser Ala Ser Val Tyr 
2425 2430 2435 

ccc tat ttt gac gtg aag gee gtc cgc cgc ttc gag tga accccoacca 
Pro Tyr Phe Asp Val Lys Ala Val Arg Arg Phe Glu acccc S a ^ a 
2440 2445 

ggagcaacat ggggctgaac caagtatcaa tgcagggttc cactctgcta cttcaaggtc 

tccgcgcatg ccc atg tea ccc ate gtt gtt etc ctg gtg ata gcg ctt 
Met Ser Pro He Val Val Leu Leu Val He Ala Leu 
2450 2455 

ttc tec ggg gca gee gec gcg caa teg cgt cca gca gee acg ggc 

?J«n r Gly Ala Ala Ala Ala Gln Ser Pro Ala Ala Thr Sly 

2460 2465 2470 

ttg ggg gta cgc egg att gag ttc teg gec gag aac ttc gag gca 

Leu Gly Val Arg Arg He Glu Phe Ser Ala Glu Asn Phe Glu £a" 

2475 2480 2485 

atg acg aca ccg gag gta cag ata agc ccg ggg gtt tec acc aca 

2490 ^ 2S5 Gln 116 Ser Pr ° Val Ser ?nr Thr 

ttc gag ttc aac tec gec etc ctt cag gag aag gtc gcg gtt aaa 
Phe Glu Phe Asn Ser Ala Leu Leu Gln Glu Lys Val ill vll Jlu" 
^ 505 2510 2515 



26159 

26204 

26249 

26294 

26339 

26384 

26429 

26474 

26523 

26583 
26632 

26677 

26722 

26767 

26812 



ggc 



gec gac cgc ttc teg etc gtg gac att gga 



cga agc act etc 



26857 
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Gly Ala Asp Arg Phe Ser Leu Val Asp lie Gly Arg Ser Thr Leu 
2520 2525 2530 



egg ctg gtg ccg teg gag caa gtc ctg 
Arg Leu Val Pro Ser Glu Gin Val Leu 
2535 2540 

gtg acg gtg egg ttc ctg gac gga gca 
Val Thr Val Arg Phe Leu Asp Gly Ala 
2550 2555 

ttc ate etc gtc gcg cac ccc get cag 
Phe He Leu Val Ala His Pro Ala Gin 
2565 2570 

gtg tgg cgc aac cag cgc acg gtg gag 
Val Trp Arg Asn Gin Arg Thr Val Glu 
2580 2585 



ccc ggc gaa cgc ctg cga 26902 

Pro Gly Glu Arg Leu Arg 
2545 

gcg ccg gta ggc atg gca 26947 

Ala Pro Val Gly Met Ala 
2560 

gcg gag cgc ttg gtc gag 26992 

Ala Glu Arg Leu Val Glu 
2575 

tec tat caa cag gaa teg 27037 

Ser Tyr Gin Gin Glu Ser 
2590 



aaa gaa gee cga gcg gag gee caa agg tgt cac gaa gag aac gag 27082 
Lys Glu Ala Arg Ala Glu Ala Gin Arg Cys His Glu Glu Asn Glu 
2595 2600 2605 

cgc ctg cgc gee gag tat gcg gga ccg ggc ggc etc get ggc etc 27127 
Arg Leu Arg Ala Glu Tyr Ala Gly Pro Gly Gly Leu Ala Gly Leu 
2610 2615 2620 



ctt gcg aac cgc gtg ate ggc aag teg 
Leu Ala Asn Arg Val He Gly Lys Ser 
2625 2630 

etc gac ttc gag aag gaa gtc cgc cag 
Leu Asp Phe Glu Lys Glu Val Arg Gin 
2640 2645 

agg att ctt cgt gca tgg age tac cgc 
Arg He Leu Arg Ala Trp Ser Tyr Arg 
2655 2660 

gtg etc atg gat ttg gac tac cca gaa 
Val Leu Met Asp Leu Asp Tyr Pro Glu 
2670 2675 

gcg cag gga gca teg ttg gtg age aag 
Ala Gin Gly Ala Ser Leu Val Ser Lys 
2685 2690 

atg etc cct gta tgg cag gaa gcg ccc 
Met Leu Pro Val Trp Gin Glu Ala Pro 
2700 2705 



ggc gtt teg gtc aag ccc 27172 
Gly Val Ser Val Lys Pro 
2635 

cgt cct gga gac gcg ate 27217 
Arg Pro Gly Asp Ala He 
2650 

tec gee aac cgc gtg tec 27262 
Ser Ala Asn Arg Val Ser 
2665 

gee gcg caa cgc tgg ate 27307 
Ala Ala Gin Arg Trp He 
2680 

aca ggt gaa acg ctg age 27352 
Thr Gly Glu Thr Leu Ser 
2695 

gtt gee aat gee egg ate 27397 
Val Ala Asn Ala Arg He 
2710 



egg cgc etc gtg gtg gaa gca gag get gag ccg gac gcg gee cag 27442 
Arg Arg Leu Val Val Glu Ala Glu Ala Glu Pro Asp Ala Ala Gin 
2715 2720 2725 



gac acg ttc ace etc aaa ctg tgg gaa teg aac ggg ctg cgc acg 27487 
Asp Thr Phe Thr Leu Lys Leu Trp Glu Ser Asn Gly Leu Arg Thr 
2730 2735 2740 



ate 
He 
2745 



acc eta 
Thr Leu 



tec ggg gtg 
Ser Gly Val 
2750 



aca ttc cca taa tgttgagaac atcctcacta 27537 
Thr Phe Pro 
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tcaagctagc acccacatag catgaagtcc gcgcccgtgc gtgtcatggg ccgttcaaac 27597 
acctgaaaac caataactac atccttacac catccaaacc aagccccatg tctacagccg 27657 
agcaaacgaa gaaaataatc aagaagctgc gagccaccct aaacgaagga agctcagaaa 2 7717 
agattcaagc actcatccag tccatcagac tggagaatcc atctcagtgg ctgaatttga 27777 
tcgccctcgc actgggcccc atagatccta ggaatccaac attcttcgaa aaagtatccg 27837 
caataaacag attggcagaa gatcttgcgc cagaagagca gcacacgcta cgcggatacg 27897 
aaagcatcgc aagggccatg cagatcgtat cgcaagagtc cgacaaatca gaattctgga 27957 
aatcttcgcg cctcacaatg atgtggcgcc tatttacgta cgttgaatcc caatacatat 28017 
ttgttagcaa tacgattcga aagaagctca agcgggcccc accgggctcg ccatttgtca 28077 
tggcgagccc ttcaatcgcg gttccaggaa ccgatgggac cgcaacgaca agtgtcgatc 28137 
ttgtttttga atctaccgcc gaagccgccg aactaatact ccgcatctcc atgcacagga 28197 
tcgacaccca acccccaaat gaaaagttta ccccccagac cccttatgac gatcctgact 28257 
ttggcaggct cctcgcttgc gcacacgcgt ggcatcgatt cagcctgcta tggaaccaaa 28317 
tccagtatac cgactggcga cctgagcatg atccacgcgg cttcatctgg acaccaaaaa 28377 
acaaggcgca atacctccaa agcgaaatcg gaaacattcg cgagcgccta tttttgcagc 2 8437 
aagcaatgct gtctgatgga ctcggtcctg gggcagaact ggagagtaga agagaagtcc 28497 
tggaacgcct caagaggagc ttcaatctcc cgaagccagg gaaaacatgg gaccttaaga 28557 
ttgatattaa agccctgcga gatctcctcg gtcttgatcg atatgccacg gctcgcaaga 28617 
tctacatgtc gcgtcgattc ctaaccccag taatgggagc gccagcagat cctgaactgg 28677 
agaagtcaca gaaggagttg aagttgaact tcacgctgca tgaactcgcc atggctctgc 2873 7 
tgcaaaggaa tcacgcaaca ttcaagacca agtcagcaat acacaacatt gtttttgcag 28797 
tcaccaagca gcgcctatgc aaggtaatca gcgaggcgtc gggcatcgaa ttcgaattct 28857 
gcttttccca tatcaacaac atcgtcttcg acgcacagag aaaaaaccta gagatctggg 28917 
accagccgct cgtaccagta gatgaccata gggtgttgtt cgttccttcg atcattgcca 28977 
attcccatcc tattcgcaca atagagcatt tagcttcagc tcgctcaagc tccgcgtttg 29037 
actcccgggg agagtattat gagcagatga ttcaatcgag gcttgaggaa ttaacacaag 29097 
cagtcgctgt gcggggcatc aaatttcctg cttcagatgg gcggcagatc gaatacgact 29157 
tggtgcttta ttggcaaggg cacctattta tcttcgagct taagtgcctg aagatgacaa 29217 
caactcctgt ggatgagttc ggggcgaggg aggagatcga gcgcgccgag caccaattgg 29277 
tacggcgtgt ttctattgcc gaaactgact ggaaggagtt caggagccaa gcacagggag 29337 
tttcactccc cgagactccg ccagcgcgca gaaacatcca tgccataata ctgaccaaca 2 9397 
tcatgaactt tacgggccga agaactggag agatcgtgac ggcagacgac tcctgcctgt 29457 
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ttagatactt 


cgacagccct 


catgttcccg 


tatttgatgc 


tgccacgggt 


gecgaaatet 


29517 


caagcattgc 


acgtttgcgc 


cttcgcaacg 


aggggccgcc 


atctgcttcg 


gggctttggc 


29577 


aatatttatc 


aagccctccg 


cagttggagt 


ggatgcggaa 


gcaaattcgg 


cctgaatacc 


29637 


acccactgcc 


accgctggat 


gagggcaact 


atcactggaa 


gctcatggtc 


gcattcgatc 


29697 


cggccgacgc 


ccagacgact 


aaatagtgtt 


ggtttggtct 


cgttgcgaag 


gtgagcattg 


29757 


acctcggctg 


gacgctggat 


tcagcgtcta 


caacaacgat 


attgcagggt 


gaggeggagg 


29817 


ctgcgtagcc 


ttcgtttaga 


acccgtggag 


aacccgaccc 


atg acg eta tac gaa 
Met Thr Leu Tyr Glu 


29872 



2755 



ctg ttt gaa gec gat gat gac cgc gat gtc ccc gec gac ttt cct 
Leu Phe Glu Ala Asp Asp Asp Arg Asp Val Pro Ala Asp Phe Pro 
2760 2765 2770 



29917 



tgg etc ttg ate gag ate cct gag gag caa etc cgc cat gcg gcg 
Trp Leu Leu He Glu He Pro Glu Glu Gin Leu Arg His Ala Ala 
2775 2780 2785 



29962 



ctt etc ctg aat gag ggt gag tgg cga ccc teg cac att tct ggc 
Leu Leu Leu Asn Glu Gly Glu Trp Arg Pro Ser His lie Ser Gly 
2790 2795 2800 



30007 



ate tgg tat egg gtg gat ccc gag cgc cca get cag aag cag cag 
He Trp Tyr Arg Val Asp Pro Glu Arg Pro Ala Gin Lys Gin Gin 
2805 2810 2815 



30052 



egg cac gtc cac gtc gec gcg aag aag cac ate aag ate ccc acg 
Arg His Val His Val Ala Ala Lys Lys His He Lys He Pro Thr 
2820 2825 2830 



30097 



aag cag get tea tgg aac aga gac ace ace cgc cat gac cgc aag 
Lys Gin Ala Ser Trp Asn Arg Asp Thr Thr Arg His Asp Arg Lys 
2835 2840 2845 



30142 



aca ttc aat gcg aag ctg ggc teg caa ggg age tat caa gac gtg 
Thr Phe Asn Ala Lys Leu Gly Ser Gin Gly Ser Tyr Gin Asp Val 
2850 2855 2860 



30187 



gcg aag acg gcg ctg ggc ctg ccg ccc gag get gtc etc gaa cac 
Ala Lys Thr Ala Leu Gly Leu Pro Pro Glu Ala Val Leu Glu His 
2865 2870 2875 



30232 



gtt gtg gaa aat ccg gec gaa caa cag atg ctg etc ace gaa tec 
VaJ Val Glu Asn Pro Ala Glu Gin Gin Met Leu Leu Thr Glu Ser 
2330 2885 2890 



30277 



act ga^ ccc cag ttg aaa act gag ttc tac agg tgg ttg gag gec 
Thr Asp Pro Gin Leu Lys Thr Glu Phe Tyr Arg Trp Leu Glu Ala 
2895 2900 2905 



30322 



agg egg tct aac ttc gaa gcg etc gtc gag gac gcg gtt 
"» ?ro Arg Ser Asn Phe Glu Ala Leu Val Glu Asp Ala Val 

2915 2920 



30367 



aat ctg ccg etc gac ctg gaa teg taa ggcagtggtt 
Arg Asn Leu Pro Leu Asp Leu Glu Ser 



30410 
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2925 2930 

aaatgcggtg atacgagcgg tgatacgagc gcctaacaaa actggcatag ggacccggct 30470 

tctcaaggac gcagctgcaa ccaaggcgat tttgtaggca ttctttgggg cccgagacag 30530 

cgcgcgggat ctccggcacc cagttcacgg acgtcaccgt ccccggtgca gccaatggcc 30590 

cctgacccct gccgtcccca ccctcgtacc caagctgtac ccaagatgaa cggacgcggg 3 0650 

tggacctcca tggacgctcc acccaccccg ctccagaggg aaaccggccg accctcaatc 3 0710 

cctccgggca cctcgcggat cgggggagcg gggcttcaaa accggtgagg ggcgctgaga 3 0770 

ggcgtccctg gtgagttcga ttctcatggg gcgacgtgaa cccggccgtg ctccatgcgc 3 0830 

tccacaacgc ggcggaagat gaggtcgata atggcgaggg actgagcctt caggagtggc 30890 

cggaacagac cagggccacg tcagaagcga acgacctccc tttcgtaacc gaataacgaa 30950 

acagaggcga ggatcacaac tttcgaaatc cgttgacgcc cgtgggcttc atttcaggaa 31010 

tcctgggaaa tgcctctcct tgctcttctt gggcttctcc atggtagcgc ttcgcgcacc 31070 

caggctctgg tactatccat catatggagc gctccctcgg aacgacagag caggtcttct 31130 

ataaggtacg gcagaagcgc cccttcaaca tcgtcattac gggcaagctg cgaggacgcg 31190 

tggaggagca ggctgtgcgc caggggttg gtg gcc ctg caa egg cga cac ccg 31243 

Val Ala Leu Gin Arg Arg His Pro 
2935 2940 

ctg etc cga gcg aag ate gtc gcg ggt act ccg cca cgc ttc acc 31288 
Leu Leu Arg Ala Lys lie Val Ala Gly Thr Pro Pro Arg Phe Thr 
2945 2950 2955 

Ser Clt v!? T C *!* ° tg 533 gtg gtg gaa c ^ c a *9 31333 

Ser Gin Gly Val Pro Pro He Ala Leu Glu Val Val Glu Arg Lys 

2960 2965 2970 

gca gag gag tec tgg caa cag cat gtc gag gca gag ctt gac agg 31378 
Ala Glu Glu Ser Trp Gin Gin His Val Glu Ala Glu Leu Asp Arg 
2975 2980 2985 

cct ttc aac teg gac cct ggg ccg etc gtc cgc ttc ate etc gtg 31423 
Pro Phe Asn Ser Asp Pro Gly Pro Leu Val Arg Phe He Leu Val 
2990 2995 300 0 

egg ggc gag cac cgc tec gag ctg ctg tgc age tat gat cat ctg 31468 
Arg Gly Glu H ls Arg Ser Glu Leu Leu Cys Ser Tyr Asp His Leu 
3 °05 3010 3015 

ate gga gac gcg cac tec ggc ate ttc gcc ctg cgc gat ctg ctg 31513 
lie Gly Asp Ala His Ser Gly He Phe Ala Leu Arg Asp Leu Leu 
3020 3025 3030 

Sin Sal 2? IT» c CC ~? g ga ° C ? g CaC ° tg CCC gag ctc * cc cca 31558 
Gin Val Met Ala Ser Gin Asp Gin His Leu Pro Glu Leu Ala Pro 

3035 3040 3 04 5 

agg ccc gcc tat gag gag ctg att ggg ccg atg gta ccg ggc acc 31603 
Arg Pro Ala Tyr Glu Glu Leu He Gly Pro Met Val Pro Gly Thr 
3050 3055 3060 
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2S tin T Ctt 2?° g ? C gCC gtg CgC Cgg ggc tcc tc ^ ff°sr ctg ctg 31648 
a i^eu jjeu Gly Ala Ala Val Arg Arg Gly Ser Ser Ala Leu Leu 
3065 3070 3Q75 

cgc ttg tct ccg acc etc aat acg etc aca gag egg etc gtc gca 31693 
Arg Leu Ser Pro Thr Leu Asn Thr Leu Thr Glu Arg Leu Sal £a 
3080 3085 3090 

ccg egg ggc atg get cgc age age gee gag gga cag gcg etc tac 317™ 
Pro Arg Gly Met Ala Arg Ser Ser Ala Glu Sly Gin 111 Leu Tyr 
3095 3100 310 5 

Ser nil 2£ tS* Zf* ° CC g&g Cag Ctc gcg cgg ctg ctt g cg 31783 

Ser His Arg lie Leu Glu Pro Glu Gin Leu Ala Arg Leu Leu All 

3H0 3115 ~ 3120 

Arg Ss Ara Slu ??* t CC o CC CaC gCa gCg CtC ggg gcc 31828 

Arg cys Arg Glu Gin Gly Ser Ser Val His Ala Ala Leu Gly Ala 

3125 3 130 3135 

gcg ctg ctg ata get egg gee gag teg cag ggt gee aag aag egg 31873 
Ala Leu Leu lie Ala Arg Ala Glu Ser Gin Gly Ala LyJ Lys Arg 
3140 3145 3150 

?S r CtC ^° T CtC ^ C tCt gCg ttg gat gct cgg gag cgc ttt 31918 
Val Thr Leu Thr Leu Thr Ser Ala Leu Asp Ala Arg Glu Arg Phe 
3155 3160 3165 

ggc gtg ggt gag gac ttc ggc ctg ttc acc acg ggg aag aca aac iiq« 
Gly Val Gly Glu Asp Phe Gly Leu Phe Thr Th? Gly lyl £? £p 319 " 

3170 3175 3180 

ttc ttc cgc gcg cgc ggc age acc cca ttc tgg gag ctg aca car T?nno 
Phe Phe Arg Ala Arg Gly Ser Thr Pro Phe tS SlS Leu K Sis 
3185 3190 3195 

aga etc cga teg cct ctg cag gca gca egg aag aag egg age cac 32053 
Arg Leu Arg Ser Pro Leu Gin Ala Ala Arg LyI LyJ A ?J Ser His 
3200 3205 3210 

ttg agg ctg ttc cgc ctg ate ctg gag ata tcc gcg ctc acc gtg 32098 
Leu Arg Leu Phe Arg Leu lie Leu Glu He Ser Ala Leu Thr Val 
3215 3220 3225 

S° o CC t C ° ° CC tgg atg gaa cgc gcg aca C G9 ctg ggc 32143 

Asp He Ser Ser Gin Pro Trp Met Glu Arg Ala Thr Arg Leu Slv 

3230 3235 3240 

ctg cac tcc atg etc gee ctg age aac gtg ggg cgt gtg aaa att ^?irb 
Leu His Ser Met Leu Ala Leu Ser Asn Val Gly Arg vll SS Se 
3245 3250 3255 

gee gec cgc tat ggg aac ctg acg ctc gag ggc ctt aac ttc *rrr 
Ala Ala Arg Tyr Gly Asn Leu Thr Leu Glu Sly Leu Sly p£l 
3260 3265 3 27 0 

ZZ g g ? g g f c cag ttc 3ac gfcg gtg ctg ac S gcg ate acc 32278 
Gly Thr Thr Ala Ala Gin Phe Asp Val Val Leu Thr Ala He Thr 

3275 3280 3285 

err gC c cag egg ttg gag gcc ag c ttt ctc ttc aac acc gcg cac 32323 
*ne *Aa am Arg Leu Glu Ala Ser Phe Leu Phe Asn Thr Ala His 
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til ^ *: gC g f g Cag gtg gag cag ctc gcc tc S c ^ c acc tgg gag 32368 
Met Gin Arg Glu Gin Val Glu Gin Leu Ala Ser Arg Thr Tr? Glu 
33 05 3310 3315 

Leu Leu Ala SS S! J£ Sg gCggCt3CtC cg ^gatg ttgtacatcc 32422 

3320 

gcttgacctc ttccagcggc agctccatga ccttccaggg atcccagtct ccgaagaggt 32482 
ccacgctcat cgcgctgcct cgctccaccg cacgccgcag ctggggaatg ggaagcgcct 32542 
cggcgagcag gcgaacgggc accaaggact tgtactgggc cggatcatag ttgcgggtgc 32602 
gggggccgaa catgaagccc aggctgaagg ccgtggtcag catttcctcc cagatggaga 32662 
tgccatagcc ggagagcagg tggacgaagt catgcacggc cgagtgcacg ggcagggcgt 32722 
gagggtgacc cggcagcggc acgccatggg agtggcagta cttccagaac tcgcggccaa 32782 
aggaaccctc gggcaggtgc tcgagctgct gaaagcgttg cttgaggtct ggcgcctcat 32842 
tgcgctcgcg cagcatgtca tagagcttcc agggcagctc gcccttcggc agccccaaca 32902 
tgcggcagaa gtactcaata taggcaggca cgaggaaggc ggtcagtcca acggacagcc 32962 
acagcttgga gcgcgcgtcc gcggccaggg aacgctcgcg cggtccaggg gctcgctggt 33022 
acttccgcag aaatacatcc gtgagtattc tcattcgaac accttggctc gttatacggc 33082 
tggggcagct ttcaggcagg tcgctgtctc ctggagaaag ccatacggat ccttggggtc 33142 
ctcggccatg gactccacct ggagcaggcg gcctaccccc agccgctcgg cgtgcgccag 33202 
gatgacttcc agcaccacct tgtcctccaa ggaaaacccg gtggagtcga agacggtgag 33262 
gccatcgcgg tgagagcgcc actgctcggg agagcgggtc agttcaaaga ggctggggcc 33322 
aatctctcca ggctcgagct gctggcactc cccctccacc acggcctggg ggagatagtc 33382 
cgggcatacc agggcgcgac gcagcatctc caggggcagc tctgtcttgc cgggcagatc 33442 
tgagcccacg gcgttgatat ggaggtgggc cttgatcaga cgcccggaca ccaccggccc 33502 
cgcgcccacg gcgacggagg tggccgtgca gaggatgtct gcctcctgct ccacgcgcgt 33562 
cagggggacc gccaccacct ccaggcccag gaaggccaca cgccgggcaa agcttcgcat 33 622 
ggcttcggga gcaatgtcgt gcaccagcac ccgtcccaga ggaaagacgc gcgagagcgc 33682 
atgtgcctgg gtgacggcct gagcgccaca ccctaccagg cccaataccc ggctgtcctc 33742 
ccgcgcgagc agccgactcg ccacggcgga ggccgccccg gtgcgtagcg cggtaggaaa 33802 
ggtgccatca caaacggccc gcaggtgccc ggtcgtcaca tcatacagac tgagggagga 33862 
aagaatggtg ggcaagccca gctcctccgg gctgcggggg ttgtagccca ccagcttgat 33922 
cacggcgtgg cgccccacct gcatgacggg catccactcc aacacgccgg ggtgggggtg 3 398 2 
ctgatactgg aagccctcgc gcttgcgcag ctcggtgttg gcggggtcga agtgctccag 34042 
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ctcgtactcg agggtggaga tgacctcatc catcagcgcg tgcagccccg ttgcacggac 34102 

gatctggcgc agttcctgct cacccaggat ccttgtcatc cgcatggctg accccgcccc 34162 

tttcagctgc atgacaggcc tcactgcatt cacggctcca gttcagggcc gggcaaaatc 34222 

tatcttcaac atccccttgt ccaggaccgc gtccacggca tccagctgcg ccaccgagcg 34282 

cggcagcatg acgtggcggc ggaagccgcc cacgcggatg atgaggtcat tgccctggcg 34342 

atccaggaca atgtctccgc ggtcggcacc tggcatgagg agctgcagcc gatagcggcc 3 4402 

ctcgtgctgc ttggcgaact gatagggctg cccggagatg tagcgctcgg tggggtccga 34462 

gtccccgtag agccgctgcg ccagatcctc caaccgctcc agccccacca cctcgtgctc 34522 

cagcagcggg agcctggcca ccggcatggg ggagaagtac tcctggatgt gatccaggta 34582 

ggcagattga gacttgtgcc actgagcaaa gtattccgtg tccggcagca gacgattgac 34642 

cacgacctgg tccaccgtca tgccatacat gttgaagtag aggtaggccc gctgtgtctc 34702 

gcggatgacc attttttccg cgctggagac caatcgcacc gtggtgcgat tgccatccgt 34762 

gagcaaccct tcaatgccct tgacgcgatc gaagagctgc tcgagggagg caaagtagct 34822 

gtcctccggc agatcatagc tcgtgagccg actggccagg ggccgggcca gcttgacgag 34882 

cgtgcgatcc acattgaagc gccggcggat ataccactgc agcacgctgg tgatgttgac 34942 

gaagcgcagc gactcgcccg tgggcggaca gtccacgatg atcacgtcat agcgcccctg 35002 

ctgcacgtac tggttcaggt agatgagcga gatgacgtcc tccgtgcccg ggagaatggc 35062 

gacctcctcg gccaccatgt tggatacccc ggtggaggtc agcagcacgg acatgtagtt 35122 

atagacttca ctccactgcc gctggagttc gtgctggatg tcgatctctt gcagttcaag 35182 

gttgggggcg accttctggg gcagtccctc gttgaagtcg aagagcttgc gatcgagatc 35242 

gaagctgtcc gagaggctgt gggcaatgtc gaaggacatc accaaggtgc ggtagccgcg 35302 

cttggcagcg gccaccgcgg tggccgcgga tgccgtcgtc ttgcctaccc cacccttgcc 35362 

agagaaaaga agaattcttg tcatggatgc cccgcgtgcc tgccggcaac gggccgccat 35422 

ggcgctctcc ccctccatgg gtcacctgtt gcgtctgtgg aactcatcac tatcagcgac 35482 

atgccagaag gagcaagctg gacggtggct tgacttggt atg acg ctg gac agt 35536 

Met Thr Leu Asp Ser 
3325 

agg tat gcc aat aga ggt ggg ggg ctt tat gaa ate gat gga ttg 35581 
Arg Tyr Ala Asn Arg Gly Gly Gly Leu Tyr Glu He Asp Gly Leu 
3330 3335 3340 

get ate ggg cae age ccc ccc ect tgt tea att cca gga ggg gtg 35626 
Ala He Gly His Ser Pro Pro Pro Cys Ser He Pro Gly Gly Val 
3345 3350 3355 

caa ttg aca gcc aca gcg act gcc gag cae gcg ctg aaa gcc etc 35671 
Gin Leu Thr Ala Thr Ala Thr Ala Glu His Ala Leu Lys Ala Leu 
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3370 



egg gec aag gga etc gec acg 
Arg Ala Lys Gly Leu Ala Thr 
3375 3380 

egg cac gec age ggc gaa aag 
Arg His Ala Ser Gly Glu Lys 
3390 3395 

cgc atg tgg ttc ctg gag cag 
Arg Met Trp Phe Leu Glu Gin 
3405 3410 

cat ttg ttg caa agt cat cac 
His Leu Leu Gin Ser His His 
3420 3425 

ggg ttg egg egg gcg ttg ggg 
Gly Leu Arg Arg Ala Leu Gly 
3435 3440 

etc egg ctg gtg gtt gee att 
Leu Arg Leu Val Val Ala He 
3450 3455 

gtg egg ccc gag tgg tgg ccc 
Val Arg Pro Glu Trp Trp Pro 
3465 3470 

ggg gtc gec cag gcg gac agg 
Gly Val Ala Gin Ala Asp Arg 
3480 3485 

cgc gtg gag get gcg gcg ccc 
Arg Val Glu Ala Ala Ala Pro 
3495 3500 

ttc cga gtc egg ctg gcg cga 
Phe Arg Val Arg Leu Ala Arg 
3510 3515 



teg gac gag gcg ctg 
Ser Asp Glu Ala Leu 
3385 

get ccc etc tea tec 
Ala Pro Leu Ser Ser 
3400 

etc gag cca ggc aac 
Leu Glu Pro Gly Asn 
3415 

ata cag ggg cct ttg 
He Gin Gly Pro Leu 
3430 

etc atg gtc aag cgc 
Leu Met Val Lys Arg 
3445 

ggc gcg gcg ggc cct 
Gly Ala Ala Gly Pro 
3460 

gag ctg cct etc age 
Glu Leu Pro Leu Ser 
3475 



ccc gcg gtg 
Pro Ala Val 



tea cag egg 
Ser Gin Arg 



gcg gee cag 
Ala Ala Gin 



cag gta gag 
Gin Val Glu 



cat gag gcg 
His Glu Ala 



gag cag cgt 
Glu Gin Arg 



gat ctg age 
Asp Leu Ser 



get egg 
Ala Arg 



ttc aac 
Phe Asn 



ctg gcg 
Leu Ala 



ctg gtc ace ctg cat cac ctg 
Leu Val Thr Leu His His Leu 
3525 3530 

gag gtg .etc ate aaa gag ctg 
Glu Val Leu He Lys Glu Leu 
3540 3545 



gaa ggc 
Glu Gly 
3555 

gac tac 
Asp Tyr 
3570 

ttg gag 
Leu Glu 
3585 



teg age 
Ser Ser 



get ggc 
Ala Gly 



gee tgg 
Ala Trp 



ccg gag ctg 
Pro Glu Leu 
3560 

tgg gag gca 
Trp Glu Ala 
3575 

tgg egg cag 
Trp Arg Gin 
3590 



ate teg 
He Ser 



gcg atg 
Ala Met 



gcg cca 
Ala Pro 



age ttc 
Ser Phe 



egg etc 
Arg Leu 



gee ctg gcg 
Ala Leu Ala 
3490 

ctg cag caa 
Leu Gin Gin 
3505 

gaa acg gag 
Glu Thr Glu 
3520 

gat gga gee 
Asp Gly Ala 
3535 

etc tac ggc 
Leu Tyr Gly 
3550 

ctg ccc gtc 
Leu Pro Val 
3565 

gcg ccc gcg 
Ala Pro Ala 
3580 

gec ggg gtg 
Ala Gly Val 
3595 



gag ctg gcg 
Glu Leu Ala 



ggc cca ctg 
Gly Pro Leu 



cat gtg ctg 
His Val Leu 



tgg age tgc 
Trp Ser Cys 



cag cac gtg 
Gin His Val 



caa tac agg 
Gin Tyr Arg 



gga gag gec 
Gly Glu Ala 



ccc acg gtg 
Pro Thr Val 



35716 



35761 



35806 



35851 



35896 



35941 



35986 



36031 



36076 



36121 



36166 



36211 



36256 



36301 



36346 



ctg gag ctg ccc acg gag gga gcg cgt ccc etc aga cag acg tac 3 6391 
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Gin Thr Tyr 



egg gca ggg egg gtg gec ate 
Arg Ala Gly Arg Val Ala lie 
3615 3620 



acc gtg cag ccg cgc 
Thr Val Gin Pro Arg 
3625 



ctg cgc cag 
Leu Arg Gin 



36436 



gcg ctg gag gac ctg gcg cac 
Ala Leu Glu Asp Leu Ala His 
3630 3635 



aag gag ggc gtg tea 
Lys Glu Gly Val Ser 
3640 



etc ttt gec 
Leu Phe Ala 



36481 



ctg ctg etc acc gee ttc acc 
Leu Leu Leu Thr Ala Phe Thr 
3645 3650 



acg ctg eta cac cgc 
Thr Leu Leu His Arg 
3655 



tac tct cgc 
Tyr Ser Arg 



36526 



cag gag gaa ttg gtg gtg ggg 
Gin Glu Glu Leu Val Val Gly 
3660 3665 



tgg ccc gec ccg cag 
Trp Pro Ala Pro Gin 
3670 



cgc ccc egg 
Arg Pro Arg 



36571 



ccc gag ctg cat ggc etc att 
Pro Glu Leu His Gly Leu lie 
3675 3680 



gga tac ttc ggc acc 
Gly Tyr Phe Gly Thr 
3685 



ccg gtg gca 
Pro Val Ala 



36616 



ttg cgc teg egg ctg gag cct 
Leu Arg Ser Arg Leu Glu Pro 
3690 3695 



cac acc egg gtg egg 
His Thr Arg Val Arg 
3700 



gag gcg ctg 
Glu Ala Leu 



36661 



egg cag etc gat cag gag gtg 
Arg Gin Leu Asp Gin Glu Val 
3705 3710 



cgt gag gec aac gee 
Arg Glu Ala Asn Ala 
3715 



cat gcg gcg 
His Ala Ala 



36706 



etc ccc ttc gag egg ctg gtg 
Leu Pro Phe Glu Arg Leu Val 
3720 3725 



age ctg eta gac att 
Ser Leu Leu Asp lie 
3730 



gee cgc age 
Ala Arg Ser 



36751 



ccc age cgc cac ccg ctg ttc 
Pro Ser Arg His Pro Leu Phe 
3735 3740 



cag gtg ctg ttc gac 
Gin Val Leu Phe Asp 
3745 



ctg ctg ccg 
Leu Leu Pro 



36796 



gag cag ccc cag cct acc get 
Glu Gin Pro Gin Pro Thr Ala 
3750 3755 



ggc ggc tgc gec ttc 
Gly Gly Cys Ala Phe 
3760 



egg ccc tgg 
Arg Pro Trp 



36841 



gag tec ttc acc ggc ctg gtg 
Glu Ser Phe Thr Gly Leu Val 
3765 3770 



gcg tat gat etc act 
Ala Tyr Asp Leu Thr 
3775 



ctg ctg ctg 
Leu Leu Leu 



36886 



gag ccg cgc ggt gag ggg ctg 
Glu Pro Arg Gly Glu Gly Leu 
3780 3785 



gag ggg gcg ctg gac 
Glu Gly Ala Leu Asp 
3790 



tac age gcg 
Tyr Ser Ala 



36931 



ctg ttc age gag gcg cgc 
Asp Leu Phe Ser Glu Ala Arg 
3795 3800 



atg cag cgc gee gee 
Met Gin Arg Ala Ala 
3805 



act cag tac 
Thr Gin Tyr 



36976 



ctg cac ctg ctg gag cag etc 
Leu His Leu Leu Glu Gin Leu 
3810 3815 

Ley ctg gcg etc etc acg 

aci* Arg Leu Ala Leu Leu Thr 
— 25 3830 



gtg gag egg cct cac 
Val Glu Arg Pro His 
3820 

cct ggt gag cac gag 
Pro Gly Glu His Glu 
3835 



gag agg etc 
Glu Arg Leu 



get ttg ctg 
Ala Leu Leu 



37021 



37066 
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gcg ggg cac ggc ctg ggc ggg ccc gtg gag ggc gcg cag gtg ctg 
Ala Gly His Gly Leu Gly Gly Pro Val Glu Gly Ala Gin Val Leu 
3840 3845 3850 



37111 



gcc cac cac cgc ttc gag cac cag gtg egg etc acc 
Ala His His Arg Phe Glu His Gin Val Arg Leu Thr 
3855 3860 ~ 3865 



ccc cat cac 
Pro His His 



ccc gcg ttg tgc ttc ggc ccg cag gtg etc tec tac gag cag etc 
Pro Ala Leu Cys Phe Gly Pro Gin Val Leu Ser Tyr Glu Gin Leu 
3870 3875 3880 

a« ^ C g ?° ° Cg Ct ° gCC ° aC cgg ctg cgg c ^ c ct & 

Asn Arg Arg Ala Asn Pro Leu Ala His Arg Leu Arg Arg Leu Gly 

388 5 3890 3895 



gcg ggc 
Ala Gly 
3900 

gag ctg 
Glu Leu 
3915 

ttc ctt 
Phe Leu 
3930 



ccg gac acc ctg gta ggg ctg tgc gtg gag cgc tec ctg 
Pro Asp Thr Leu Val Gly Leu Cys Val Glu Arg Ser Leu 
3905 3910 



ccg gtg gca ctg ctg gcc ata tgg aag gcc 
Pro Val Ala Leu Leu Ala He Trp Lys Ala 
3920 3925 



ggg gcc ggc 
Gly Ala Gly 



ccg ctg gac gtc aat cag ccc cgc gag egg etc gcc ttc 
Pro Leu Asp Val Asn Gin Pro Arg Glu Arg Leu Ala Phe 
3935 3940 



etc ctg gga gat gcg age tgc cgc ate etc etc acc 
Leu Leu Gly Asp Ala Ser Cys Arg He Leu Leu Thr 
3945 3950 3955 



cag gag cac 
Gin Glu His 



ctg etc cag cgc ctg ccc ccc acg aac gcc gcg etc ctg tgt ctg 
Leu Leu Gin Arg Leu Pro Pro Thr Asn Ala Ala Leu Leu Cys Leu 
39 60 3965 3970 



gag agg 
Glu Arg 
3975 



gag gcg gag gca ctg gag cgc gag ccc cag gag gat gcg 
Glu Ala Glu Ala Leu Glu Arg Glu Pro Gin Glu Asp Ala 
3980 3985 



cca cac gag gcc ggg etc gac aac ctg gcc tac gtc ate cac acc 
Pro His Glu Ala Gly Leu Asp Asn Leu Ala Tyr Val He His Thr 
3990 3995 4000 

tct ggc tec acg ggc acg ccc aag ggc ate gcg atg gtg cat cgc 

SSL Thr Gly Thr Pro Lys Ile Ala Met Val His Arg 

4005 4010 4015 

tgt ctg gcc aac ctg gtg gcc tgg cag etc acc cat gag egg ctg 

YS ^nnn ASn LeU Val Ala Trp Gln Leu Thr His Glu Arg Leu 

4020 4025 4030 

ggt ggc ccc tea cgc acg etc cag ttc gcc teg etc aac ttc gac 

Y ?no C SSr Arg Thr LeU Gln Phe Ala Ser Asn Phe Asp 

4035 4040 4045 

ate tgc tac cag gaa etc ttc acc acc tgg gcc get ggg ggc acc 
Ile Cys Tyr Gln Glu Leu Phe Thr Thr Trp Ala Ala Gly Gly Thr 
4050 4055 4060 

fr fc ? 5*? S fc ? ^ g gaa gag gtg cgC agg gat cct 9oa egg ctg 

Val YnL Met Val Thr Glu Glu Val Arg Arg Asp Pro Ala Arg Leu 
4065 4070 4075 



37156 



37201 



37246 



37291 



37336 



37381 



37426 



37471 



37516 



37561 



37606 



37651 



37696 



37741 



37786 
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ctg gaa gtg ctg gag cag gag cag gfcg age egg ctg tac ctg ccc 

Leu Glu Val Leu Glu Gin Glu Gin Val Ser Arg Leu Tyr Leu Pro 
4080 4085 4090 

ttc ate gee ctg cag cag etc gcg egg gtg gcg gat gag cgc ggc 

™ a LeU Gln Gln Leu Ala ^9 Val Ala A SP Glu Arg Gly 
4095 4X00 410 5 

gcg gee cct cgc cac ctg cgc cag etc att acc gee ggg gag caa 

Ala Ala Pro Arg His Leu Arg Gln Leu He Thr Ala Gly Glu Gln 
4110 4115 4120 

etc cag gcg acc ccc gag ttg cag egg ctg etc teg egg atg ccg 

Leu Gln Ala Thr Pro Glu Leu Gln Arg Leu Leu Ser Arg Met Pro 
4125 4130 4135 

gag tgc acc etc cac aac cag tac ggc ccc tec gag tgc cac gtg 

Glu Cys Thr Leu His Asn Gln Tyr Gly Pro Ser Glu Cys His Val 
4140 4145 415Q 

^ g ga ° Ctg aCa Cgc gag ccg a ^ c c S fc <=ca egg 

Val Thr Ser His Asp Leu Thr Arg Glu Pro Ser Arg Trp Pro Arg 
4155 41 60 4165 

ctg ccg ccc gta ggc egg cca ctg gee cac etc egg gtg ctg ttg 

ff?« Pr ° Val Gly *= g Pr ° Leu Ala His Leu Arg Val Leu Leu 
4170 4175 418Q 



ctg gac ggg gag cag cag etc gtg cct ccg ggt gtg 
Leu Asp Gly Glu Gln Gln Leu Val Pro Pro Gly Val 
4185 4190 4195 



gee gga gag 
Ala Gly Glu 



fr ? r 3 ggc ggc ccc gcc ctg gcg C 9 C firer° tac ctff cgt 

™ LeU Gly GlY Pro Ala Leu Ala Ar g Gly Tyr Leu Gly Arg 
4200 420 5 4210 

ccg gag cag acg get gat cgc ttc gtg ccc gat cca ttc tec cgc 
Pro Glu Gln Thr Ala Asp Arg Phe Val Pro Asp Pro Phe Ser Arg 
4215 4220 4225 

gag cct gga gcc egg etc tac cgc acc ggg gac ctg gcg cgc etc 
Glu Pro Gly Ala Arg Leu Tyr Arg Thr Gly Asp Leu Ala Arg Leu 
4230 4235 4240 

cga gag gac gga gcg ctg gag ttc ctg caa egg atg gac gcg cag 

g ™ ASP Gly Ala Leu Glu Phe Leu Gln Met Asp Ala Gln 

4245 42 50 4255 

gtg aag ate egg ggc tac cgc ate gag cct ggg gag ate gag gtg 
Val Lys He Arg Gly Tyr Arg He Glu Pro Gly Glu He Glu Val 
4260 4265 4270 

gtg ctg tgc gag cac ccc gcc gtg cac caa gcg cac gtg egg ccc 
Val Leu Cys Glu His Pro Ala Val His Gln Ala His Val Arg Pro 
4275 4280 4285 

tar g-g gac age gcc gga gag egg cgc ctg gtg gcc tac gtg gcc 
ryr Val Asp Ser Ala Gly Glu Arg Arg Leu Val Ala Tyr Val Ala 
4290 4295 4300 

gca egg etc gag gac acg gac ggg gcg gag aca gag cac gtg gag 
a±s. Arg i,eu Glu Asp Thr Asp Gly Ala Glu Thr Glu His Val Glu 



37831 



37876 



37921 



37966 



38011 



38056 



38101 



38146 



38191 



38236 



38281 



38326 



38371 



38416 



38461 



38506 
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4310 



4315 



cgc tgg cgc gcg gtg tgg gat 
Arg Trp Arg Ala Val Trp Asp 
4320 4325 



gag acc tat ggc ggc 
Glu Thr Tyr Gly Gly 
4330 



cca age ggc 
Pro Ser Gly 



acg gec 
Thr Ala 
4335 



ttt gac ctg gec ggg tgg aat gac age gtg cgc ggc gag 
Phe Asp Leu Ala Gly Trp Asn Asp Ser Val Arg Gly Glu 
4340 4345 



ccg ctg cct ccg gag cag atg cga gag tgg gtg gag acg aca gtg 

Pr ° Pr ° Glu Gln Met Arg Glu Tr P Val Glu Thr Thr Val 
4350 4355 4360 



gag egg etc atg gag etc gtt 
Glu Arg Leu Met Glu Leu Val 
4365 4370 



ccc egg egg gtg ctg 
Pro Arg Arg Val Leu 
4375 



gag ctg ggc 
Glu Leu Gly 



22 o ?? g ctg ctc cgc cgg ctc gcc ccc cac tgc w 

° ly LeU Leu Leu Arg Arg Leu Ala Pro Arg Cys Glu 
4380 4385 4390 

tec tac tgg ggc acg gag ctc tec ccg gta gcc gtc gag egg ctg 

Ser Tyr Trp Gly Thr Glu Leu Ser Pro Val Ala Val llu A^g Leu 
4395 4400 4405 

agg gag caa ctc cag acg ggc ggc tec ccg ctc gcg cag cgc gtg 

g Gln Thr Gly Gly Ser Pro Leu Ala Gin Arg Val 

4410 4415 442Q 



egg ctc atg gcc cag ccg gcg gat gac ttc tec ggg 
Arg Leu Met Ala Gln Pro Ala Asp Asp Phe Ser Gly 
4425 4430 4435 



ctt ccg gaa 
Leu Pro Glu 



gcc ggc ttc gac acc gtc ate ctc aac teg gtg acg cag ctc ttc 

«Y« PhG ASP Thr Val Ile Leu Asn Ser Val Thr Gln Leu Phe 
4440 4445 445Q 

Pro £r S? T C ^f C T Ctg Ctg gtC gtg gag ggC gcg ctg c ^ 

P ^ r LeU Leu Arg Val Val Glu G1 y Ala Leu Arg 
4455 4460 4465 

Sf r tC CCg g ? C gga aCC Ctg fctc att g gc gac gtg cag aat 

n Pr ° GlY Gly Thr Leu Phe Ile G1 Y A ^P Val Gln Asn 
4470 4475 4480 

ctg cgg etc ttc gag ctg ttc cat gcc teg gtg gcc ctg gag cag 
Leu Arg Leu Phe Glu Leu Phe His Ala Ser Val Ala Leu Glu Gln 
4485 4490 4495 

gcg tea gcg gac ctg gag gcg cca gcg ctg ctc gcc cgc acc cgc 
Ala Ser Ala Asp Leu Glu Ala Pro Ala Leu Leu Ala Arg Thr Ara 
4500 4505 4510 



cag cgc 
Gln Arg 
4515 



atg ctg ctg gac gaa 
Met Leu Leu Asp Glu 



cgg ttg tac gtg gac ccg gac ttc 
Arg Leu Tyr Val Asp Pro Asp Phe 
4520 4525 



SK T tC g ? C ttfc CCC Cag ctg ggc gcg gtc cgg 

Phe Ala Ala Leu Ala Thr His Phe Pro Gln Leu Gly Ala Val Ara 

4530 4535 4540 



38551 



38596 



38641 



38686 



38731 



38776 



38821 



38866 



38911 



38956 



39001 



39046 



39091 



39136 



39181 



ctc aag cgc ggg age ggg a gg aac gag atg aac cgc ttc 



39226 
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Leu His Leu Lys Arg Gly Ser Gly Arg Asn Glu Met Asn Arg Phe 
4545 4550 4555 



cgc tac 
Arg Tyr 
4560 

ccc gcg 
Pro Ala 
4575 



gac gtg gag ctg cag etc gcc ccc gta gcg aaa gca ggc 
Asp Val Glu Leu Gin Leu Ala Pro Val Ala Lys Ala Gly 
4565 4570 



caa gag ctg cca gag ctg gac tgg agg cat 
Gin Glu Leu Pro Glu Leu Asp Trp Arg His 
4580 45 8 5 



ggc ctg gaa agg ctg gag cgc 
Gly Leu Glu Arg Leu Glu Arg 
4590 4595 



atg ctg gcg gag cga 
Met Leu Ala Glu Arg 
4600 



gag ggg etc 
Glu Gly Leu 



ccg gcg ggg 
Pro Ala Gly 



ctg gtg ctg cgc aat gtg gcc aac gcc cgc acg gcg gac gag gcg 
Leu Val Leu Arg Asn Val Ala Asn Ala Arg Thr Ala Asp Glu Ala 
4605 4610 4615 

gca egg ctg gcg ctg ctg egg act ggc aac age gtg ggc egg ctg 
Ala Arg Leu Ala Leu Leu Arg Thr Gly Asn Ser Val Gly Arg Leu 
4620 4625 4630 

egg gca ctg ccg gcg gtg aca tec tgg aat cca gag cag etc tgg 
Arg Ala Leu Pro Ala Val Thr Ser Trp Asn Pro Glu Gin Leu Tro 
4635 4640 4645 



agg ctg gcg gag gca gcc gac 
Arg Leu Ala Glu Ala Ala Asp 
4650 4655 



tac acg tgc cac gtc acc tgg age 
Tyr Thr Cys His Val Thr Trp Ser 
4660 



gcc cag 
Ala Gin 
4665 



gac gag gaa ggc cgt ttc gac gcg ctg etc atg gcc cgc 
Asp Glu Glu Gly Arg Phe Asp Ala Leu Leu Met Ala Arg 
4670 4675 



gcg get ggc tec teg cgc ccg gcg gcc tgg ctg acg cct ccc ccg 

Ala Ala Gly Ser Ser Arg Pro Ala Ala Trp Leu Thr Pro Pro Pro 
4680 4685 4690 

cct ccc cct cga ccc tgg aag age tac gcc aac cag ccc ctg gcg 

Pro Pro Pro Arg Pro Trp Lys Ser Tyr Ala Asn Gin Pro Leu Ala 
4695 4700 4705 



gcc age cgc egg cgc acc etc 
Ala Ser Arg Arg Arg Thr Leu 
4710 4715 



gtg gga gtg ctg cga age cac etc 
Val Gly Val Leu Arg Ser His Leu 
4720 



gag cac 
Glu His 
4725 



aag ctg ccc gag tac atg gtg ccc tct tec ttc gtg ctg 
Lys Leu Pro Glu Tyr Met Val Pro Ser Ser Phe Val Leu 
4730 4735 



ctg gac gcg ctg ccg etc aag ccc acc ggc aag ttg 
Leu Asp Ala Leu Pro Leu Lys Pro Thr Gly Lys Leu 
4740 4745 4750 



gag gtg gcg 
Glu Val Ala 



gca ctg 
Ala Leu 
4755 



cct ccc ccc gag ccc gca caa gcc gag caa acg cca ggc 
Pro Pro Pro Glu Pro Ala Gin Ala Glu Gin Thr Pro Gly 
4760 4765 



cac etc gcg ccg egg acc ccc 
His Leu Ala Pro Arg Thr Pro 
4770 ~ 4775 



acc gag cgc egg ctt gcc gag ctg 
Thr Glu Arg Arg Leu Ala Glu Leu 
4780 



39271 



39316 



39361 



39406 



39451 



39496 



39541 



39586 



39631 



39676 



39721 



39766 



39811 



39856 



39901 
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tgg egg cgc gtg etc ggc gtg cat cgc gtg ggg gt g gag gac aac 39946 

^ v£L Arg Val Leu Gly Val His ^ Val Val Glu Asp Asn 

4785 4790 4795 

tt C Sf 9 ° tg 9gg ggc Cac tcg ctt ct S ^ cc aca egg etc tt g 39991 
, Gln Leu Gly Gly His Ser Leu Leu Ala Thr Ar g Leu Leu 
4800 4805 4810 

tcg etc ate cgc gga g ag ctg ggg ctg gag ctg ccc ctg egg gt g 40036 
Ser Leu lie Arg Gly Glu Leu Gly Leu Glu Leu Pro Leu Arg Val 
4815 4820 4825 

2u 111 ni 9 ^ Ctg gCg gCC atg gCg ggt fc ff fc ctc W 4 0081 
Gln Pr ° Thr LeU Ala Ala Met Ala G1 y Cys Leu Glu 
4830 4835 484 0 

get cag age tgg age acc cag get ccc cac cct ccc tct gee tec 40126 
Ala Gln Ser Trp Ser Thr Gln Ala Pro His Pro Pro Ser Ala Ser 
4845 4850 485 5 

ate gag gag gga gaa ctt tga gcgccggagc gttgctcgcc cacgccgcga 40177 
He Glu Glu Gly Glu Leu 
4860 

gcctgggtgt acgcctgtgg gtagagggcg agegactgeg cttccaggct cccccagggg 4023 7 

tgatgacgee egagctgeag tcccgcctgg g aggcgctcg ccatgagctc atcgcactgc 40297 

taeggcaget ccagccctcc tctcagg gag gcagcctcct ggccccagtg gctcgaaacg 40357 

ggcggctggc gctctccttc gcccagcagc ggctgtggtt ccaggagcag ctccaccctg 40417 

aggccccggc caacaacctc aegggggecg tggtgttcac cggcccgctg cacgtggccg 40477 

cgctgctcgg ggcggt gg ct gcgctggtga ggegacaega agcgctgcgc accaccctgg 40537 

gcgaggaggg cggagtcccc tattegctea tcggcgagcc gtggcagccg gcgctggagg 40597 

tggaageget accgggtgcc accgtgggcg agcggctgga gcaggcccga gaggtggccc 40657 

tggccgagtc ccggcgccgc ttcgccctgg agaeggagee ccacctgcgc g t gCg cct g c 40717 

tgcgcctggc cgagcagcag cacgtgctgg tgctcagcct ccaccacatc gcggcggatg 40777 

gagteggget geaggtgetc gaacaggagc tggeggeget ctacggcgcg ctgagcgctg 40837 

gggccgagcc gcggctacct cccctgcccc tgcaggtggc ggacctt g cc gactggcagc 40897 

gccggtgggt ggagggagag gagtaccagg tccagctcgc ttactggcgg cgacagctcg 40957 

ccgggctcac gcccctg g ag g taccag g ag atcatcctc g cccgcgtatc ccctccatga 41017 

gaggegegga ggtacgggcg cccctgctct ccgctccgca ageccaggtg ctgcgcgcgc 41077 

tggggcaagg cgagggcgcc accctctaca tgacgctcct ggccgccctg ggcgtgctgc 41137 

tgcaacggtg gaegggecag cacgatatgg cggtgggcag cgccgccgcc aaccgcaacc 41197 

gcccgggctt ggagggcatc ctcggcttcc tcctcaacat cgtcctgctg cggctcgact 41257 

tgagaggccg cccccgcttc cgcgagttgc tgegtcagge aegcagggtg tgtgtggagg 41317 

cctatgccca tcaggagttg cccttcgagc acctggtgga ggcgctccag ccaggcagcg 41377 
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agcgcgggga cagctcgctc tatcgcgtgg cgctcgcggt gagcgacacg ccatggatgc 41437 

ccggccacgg gctgaagctg gagggggtgc aagcgcagcc gctcgacttt ccccgggggg 41497 

tgctggacct ggatctccac ctgtgggtgt atgacaccgg ggaggggctc accgggcggt 41557 

tggagtacgc cgtggatctg tatgaggagc ccacggcaag gcggctcctg gagggcttcc 41617 

gccaggtgct ggaggcggtg gtggaggcgc cggatcggcc cgtgccggag ctgccggtgc 41677 

tgggagagca ggagcgccac caggtgctct ccgggtggaa ccgcacacag cgcccctatc 41737 

cgcgagaggc cagcgttcac ggcctctttc agcagcgtgc cctccaggcc ccgcgggccg 41797 

tggcagtggt gtatggcgag cgcagcctca cctacggcga gttggcggag cgggcccgcg 41857 

ggctcgcaca gggtctggtg gcccgaggcg tgcggcgagg agacctggtc gcccttcggc 41917 

tggagcgctc gccggagcag gtggagtcca tgctggccgt actccaagcc ggggccgcct 41977 

acgtgccgct ggatcctt cc taccccgtac agcgccagga gttcatgctc caggacagcg 42 037 

gcgcgcggtt gctcgtgcac agcggcccgc tgcccttcgc tccccagggc tgcgccacgc 42 097 

tggatctcca ggcatggcac cccgctccct ccgacggtgg cgagcccctg ccccagtgca 42157 

gcggagaaga tctggcgtat gtcatctata cgtccggctc caccggccag cccaagggcg 42217 

tggccgtgtg ccaccgggcc atgacccgcc tggtgtgcaa cacggactac gttcagctcg 42277 

ggccagagga tcgggtcgcc caggcttcca atgcctcctt tgacgcggcc accttcgagg 42337 

tctggggcgc gctgctcaat ggggcacggc tggtggggct cgccacggag gaggccattc 42397 

aggcgcgccg gctggccgag gtcctgcgcg agcagcggat ctccgtgctc ttcgtcacca 42457 

ccgccctctt caaccacgtg gcccgggagc aaccccaggc cttcagcacc ctgcgctacc 42517 

tgctctttgg cggcgaggca gtggatgcct ccagtgtgag gcgggtgctg aaacagggcg 42 577 

ctccaggaca ccttctacac gtgtacggcc ccacggagaa caccaccttc tccaccgcct 42 637 

ggcgggtgga acacctggcc gagcaggcgc ataccgtgcc catgggtcat cccatcgcca 42 697 

actcgcggct gcacgtgctg gacgaagcgc tgcagccggt gccggtgggc gccatgggcg 42757 

aggtatacct gggaggtgat gggctggcgc tgggctactg gcgccacccc gaggccaccg 42817 

cggagcgctt cgtgccggat ccgcacggcc tggagcccgg aggccggctc taccgcacgg 42877 

7^c?tctggc gcggcggcag gcggatgggg ccgtggtgtt cgctggccgc gtggacaggc 42937 

aggtcaagct gcgaggcttc cgcgtggagc cggcggagat cgagtcgcac ctgtgcgagc 42 997 

actcggaggt gtccgcggcg gtggtggagc tgcggggcga aggggccctg cggcggctgg 43 057 

^cgcctatgt cgtcccccgc gccggtggcc gcccaggtgc cgaggagctg cgcaccttcc 43117 

-eg actgcccgag tacatgetge ccgccagctt ctccctgctg gaggcactgc 43177 

,rrnc caaeggcaag gtggatcgct cggcgctacc cgagagcttc gaggaggect 43237 
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cccgcgagca ggctccggtg gtacctccac gaggccccgt ggaggcgctg ctggtggaca 43297 
tctggaggga ggtgctcggc acccaacggg tatccgtgca cgacgacttc ttcgacctgg 43357 
gcggacactc cctgctggcc acgcgggttg tctcccgctt gcgagaggcg ctgcaggtgg 43417 
agttgccgct gcgcaccctc ttcgaggccc ctcagctctc ggcgctggcg gcccaggtgg 43477 
aggtcctgct cggccaccgg cagctccgcc cccctccgct cgtgcccgct gtacgtccac 43537 
cagagctgcc cctctccttc gcccagcagc ggctgtggtt cctccagcag ctcgcgccac 43597 
aaagcaccgc ctaccagatc ctggatgcgt ggcatgtgcg gggccgggtg gacgtgggcg 43657 
ccctggagcg cgccctggag cagctcgtgc gccggcatga ggcgctgcgc accactttcg 43717 
agcctggcgg cgacggcgtg cctcggcagc gcatccacgc gccggctccg gtcccgctgc 43777 
ggcaggtgga cctgcgctcc catggcatcg ccgcgcggga ggaggccttg cggtggatgc 43837 
gcgagcaggc gctccggcca ctggagctgg acaaggggcc gctgctgcgc gtgtccctgc 43897 
tgcgcttgga agaggcacag tccctgctct tcctcgagct gcaccatatc gtgggcgatg 43957 
gctggtcgct gagcgtttgg agccgagagc tgtcgcacct gtacgaggcg gcgctccatg 44017 
gcgctgagcc ggggttggcg ccgctgcccg tgcagtacgc ggacttcgcg ctgtggcagc 44077 
gcggctggtt gcaagggccg gtgctgcggg aggagctgac ctggtggcgg gagcggctcg 44137 
cgcgcctggc cccgctgcgc ctgcccgcgg accatgcccg cccggaagtg cagcgcttca 44197 
acggcgcgac gtaccgcttc acgctcccgg gcgtacgggt gcaggcgctg cggcggctcg 44257 
ggcatgagca cggggccaca ctcttcatgg tgttgctcgc cggcttcaat gcgctgctgg 44317 
cgcgctacac cgggcagacg gacatcgcca ttggcgcgcc catcgccaac cgcacacgtg 44377 
gggaggtgga gggcctcatc ggcttcttcg tcaacaccct ggtgctgcgc acccgcctgg 44437 
aaggcaaccc ctccttcctc gagctgttga ggcgggtgag ggagacaacg ctagaggcct 44497 
acgcccacca ggagctgcct ttcgagaggc tcgtcgagga gttgcagccc gagcgccagg 44557 
ccaaccagaa cccgctggtg caggtgctgc tcgccctgca gaacgcgccc cgcgagccgt 44617 
tgcggctggc gggcctggag gccgagcacc tggagtacct ggtggccacc acgcgcttcg 44677 
acctcgagct gcacctgtgg gaggaggaag aggggctgag ctgcatcgcc gtgtatgacc 44737 
gcgcgctgta cggcgcgggg acggtggagc ggctggtggg cgcctggtgc acgctgctgg 44797 
agggcgtggc cgagctcccg gcgcgccgcg tggcggagct gcccctggtg cctgcccggg 44857 
agctgcgaca ggttccgccg ccctcccccg ccgcggcgat cgaaacaagc atcggcgccc 44917 
gcttctcgga ggtggcgcgg cggcagccag gcgccacggc cgtcacccaa ggcgggcggc 44977 
acctcaccta tgcagagctg gaggagcgct cggaacgcct ggcccgctac ctggcctggc 45037 
tgggcgtgcg cgcgggcgac agagtggggt tggccaccga gcgcacgctg gagcggatca 45097 
tcagcctgct aggcatcctc aaggcgggag ccgcctacgt accgctggat gtgcgccagc 45157 
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cagcgcgtcg gctgagcctg ctggtgcagg ccgcgggcgt gcgcaccgtc atcgccgagg 45217 
agcaggctcg cacggtgctg tcaggcctgg ggcagccact gaccctggtc gatgccgccc 45277 
aggagccggc ctccgcgcag caggtccccg cgctcgggcc ggagcgctcg ctgggcgggg 45337 
acatgctcgc ctacgtgctt ttcacctcgg gctccaccgg cgagcccaag ggcgtctgca 45397 
ttccccaccg tgccgtgctg cggctgatcc atgagccctc ctacgtccag ctctcgccgc 45457 
gggaggtcat gctgcactac gcaccgctgg agttcgatgc ctccaccttc gaggtctggg 45517 
gggcgctgct caacggggca aggctcgtgc tggtccctcc ggaacagcag tcgctcgaga 45577 
gcctgggaca ggagctgagc acccagggcg tcaccgtgct gtggctcacc gccgggctct 45637 
tccggctcat ggtggaggag cagctcaaga gcctgcgcgg cgtgcgccag ctgctcgccg 45697 
gaggggacgt gctgcccatg ccgcaggtac gccggctgcg cgaggccctc cccgagtgcc 45757 
agctcatcaa cggctacggc cccaccgaga gctgcacctt cacctgctgt caccgcgtgg 45817 
gcagccccca ggagctgggc ggctccgtgc ccatcggcac ccccatcgac ctgggctggg 45877 
tgagcgtggt ggacgagcgg ctccagcccg tgccggacgg agccccgggc gagctgctcg 45937 
tgggcggccc cgggctggct tgggggtacc tgcaacaccc ggagctcacc gccgagcgct 45997 
tcatccccga tccgctcagc cggacgccgg gggctcgcgt ctatcggacg ggagacctgg 46057 
tgcggcgcag ggaggatggg acgctggagt tcctgggccg cgtggaccac cagctcaagg 46117 
tacgcggctt ccgcatcgag ccgggcgagg tggaggccgc ggtgctcacc cacccggcgg 46177 
tgcagtccgc ggtggtggtg gggcgcgagg gccccggcgg caaggagctg gtctgctacg 46237 
ccgtgccgcg ggtggagagc tcggagcagg gctcgcagca ggagcagcgg ctggtacacg 46297 
agtgggagtc cgtgttcgac gggcacatgt accgcgaagc ccccgtgggc ggagagccga 46357 
ccttcaacat cgtcggctgg aagagcagct acaccggcca acccgtggcc gtggaggaga 46417 
tgagggactg gctgcgccac cgggtggagc gcgtgcgcgg gctcaggcca cgacggatcc 46477 
tcgaagtggg ctgtggtacg gggctgatgc tctttgccct gctgccgcac tgcgagcgct 46537 
acgtgggcac ggatttctcc ccggcggcgt tggactacgt gcgtcgctac ctccccccgg 46597 
agcacccggg ccgcgtggag ctgctacacc gcacggcgga cgagtggagc ggcgtggcgg 46657 
ccggctcctt cgacgcggtg ctgctcaatt cggtggtgca gtacttcccc tcgcaggagt 46717 
acctgcggca ggtgctggcc cgctgcgtgg aggccgtgga ggacgggggc ttcgtcttcg 46777 
tgggggatgt gcggagcttg ccgctgctgg agtccttcca cgcctcggtg gagctggagc 46837 
23Z--2<=zzc ctccatgccg ctggaggcgt ggcgcgagcg ggtgcggcgt gcggtgctgg 46897 
aggacaacga gctggtggtg gacccggcac tcttcgtggc gctggcccat cagcatccac 46957 
gggtgagcca cgtggatatc gagctgacgc gcggcacgca cccgaatgag atggcacgct 47017 
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tccgctacaa cgccgtgctc cacatcgggc cgcgcacgcc gccaccggcc tccgaggtgc 47077 
cgtgggtgga ctggtccacg caaggactga gcctcgatgc actgcgggct cggctccggc 47137 
agggcccgcc agggccactg ggggtggcag gcattcccaa tgcgcgagtg ctgcccgcgg 47197 
tccgggccgc cgaggcgctg ggcagcacag gcagcgcgcg gcgggtggag gagctgcgcc 47257 
ggcgcctgtc ccaacccggc acaggggccc aggatccaga ccccttctgg caactggccg 47317 
agtcgctcgg ctacacggcg gcagtgagct ggtctcctgg gcggagggac ggggccttcg 47377 
acgtgctctt cctcccggcg acgccaggga tgcacccgcg gtggctcggc cccacgccgc 47437 
tcaaccgcac gccgccccca gcctccgcgc gtctgtcctc ggagcctcgg cgggccagcc 47497 
tctcactgag gctgggctcc gcgctgcggg cccacctgca gacccatcta ccagacttca 47557 
tggtgccctc acgcttcgtg gtgctgcagt ccctgccgct cactcccaat ggcaaggtgg 47617 
accgcgcggc cttgcccgtc cccgactcgc gtcggctgga atctgccccg ctggtgcccc 47677 
ccagcaacga gctggagcgg gtgctggcgc aggtgtggaa ggaagtgctg ggactggagg 47737 
aagtgagccg ggaggacaac ttcttcgacg tgggagggca ctcgctgctg ctggcgcagg 47797 
tgtgcagccg gctggaggcc cgactgggcc ggcggctcga actggtgacc ctcttccgct 47857 
actcctccat tgctgccctg gccgagcacc tgcaggcccc ccaggagctg gcagccgccg 47917 
aggcgcaggt gcagcgcatg gccacggagc gcgccttgct ccagcaacag gcagcccaac 47977 

48028 



gccgccgggc cggttcgaag cggggagctc cca atg aca cct agt ggg gat 

Met Thr Pro Ser Gly Asp 
4865 4870 



gaa gcg ctg cag tct tec ate gec ctg gta ggc atg gec ggc cgc 48073 
Glu Ala Leu Gin Ser Ser He Ala Leu Val Gly Met Ala Gly Arg 
4875 4880 4885 

H C 2? ff g ?° ° Cg ga ° gtg gag tcc ttc fc W cgc aat ctg gtg 48118 

Phe Pro Gly Ala Pro Asp Val Glu Ser Phe Trp Arg Asn Leu Val 

489 0 4895 ~ 4900 

gec gga gtg gag tcc att tcc ttc ttc tcc gag gaa gag ctg aga 481 ST 

Ala Gly Val Glu Ser He Ser Phe Phe Ser Glu Glu Glu Leu !?g 
4905 4910 4915 

cag gcg ggc gtg tcc gag cag ata cgt egg cgc ccc gag tac gtg 48 208 
Gin Ala Gly Val Ser Glu Gin He Arg Arg Arg Pro Glu Tyr ?a? 

4920 4925 4930 

ccc gec aag ggt gtg ctg gaa gac ctg gag ctg ttc gac gcg ggc 48253 
Pro Ala Lys Gly Val Leu Glu Asp Leu Glu Leu Phe Asp Ala Gly 
4935 4940 4945 

pJ« ni C I 30 o Cg CCg ° gC gag gCSr agC Cac ct & <* at cc 9 ca ST 48298 
Phe Phe Gly Tyr Ser Pro Arg Glu Ala Ser His Leu Asp Pro Gin 

4950 4955 4960 

caa cgc ctg ctg ctg gag tgc teg tgg gag gcg ctg ga g gat gcg 48343 
Gin Arg Leu Leu Leu Glu Cys Ser Trp Glu Ala Leu Glu Asp Ala 
4965 4970 4 975 
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riZ r CCg gat Cag Ctc cca ggc fc W gtg ggc gtc tac gtg 

Gly Leu Arg Pro Asp Gin Leu Pro Gly Trp Val Gly Val Tyr Val 
49 80 4985 49 c, 0 

nil ??° Hf* ? a ° ag ° tac cgc ttc cag ct ^ ct S c ^ c ZQ* cac 

Gly Ala Gly Asp Thr Ser Tyr Arg Phe Gin Leu Leu Arg Gly His 

4995 5000 5005 

rit Z Cg T CtC ag ° ggC tCC aag gat gtg ^cg ggc ttc ttt ggc 

Gly Asp Pro Leu Ser Gly Ser Lys Asp Val Ala Gly Phe Phe Gly 
5010 5015 5020 

a™ Sf C S Cg tfct CtC gCC aCg Cgc gtg ^cc tac ^g ctc aat 

Asn Tyr Pro Asp Phe Leu Ala Thr Arg Val Ala Tyr Lys Leu Asn 

5025 5030 * 5035 

ctg cgc ggc ccc gcg ctc ggc ate cac acc gcg tgc tec acc teg 
Leu Arg Gly Pro Ala Leu Gly He His Thr Ala Cys Ser Thr Se? 

5040 5045 5050 

t1?, o CC ^ C aa ° atg gCg tg ° agc gcg ct S c ^ c TOC ttc gag 

Leu Val Ser He Asn Met Ala Cys Ser Ala Leu Arg Gly Phe Glu 

5055 5060 5065 

? aC at ? g ? g CtC gCg gg ° ggc gtc tc ^ ct S C Q9 ctg ccc gcg 
Cys Asp Met Ala Leu Ala Gly Gly Val Ser Leu Arg Leu Pro 111 

5070 5075 5080 

cgc tec ggc tac ctc tac gag gag ggg gga gtc gec teg aag gac 
Arg Ser Gly Tyr Leu Tyr Glu Glu Gly Gly Val Sia Ser LyJ Asp 
5085 5090 5095 



ggg cac tgc cgc ccc 
Gly His Cys Arg Pro 
5100 

ggg gat ggc gtg ggc 
Gly Asp Gly Val Gly 
5115 

ctg aag gee agg gat 
Leu Lys Ala Arg Asp 
5130 



ttc gac gcg cgt get acc ggc acc gtc acg 
Phe Asp Ala Arg Ala Thr Gly Thr Val Thr 
5105 5110 

gtg gtg gtg ctc aag cgc ctg gag gac gcg 
Val Val Val Leu Lys Arg Leu Glu Asp Ala 
51 20 5125 



ccc ate cac gcg gtg 
Pro He His Ala Val 
5135 



ate cgc ggc tgg gcg 
He Arg Gly Trp Ala 
5140 



etc aac aac gac ggc gee tec egg get ggc ttc acc gcg ccc agc 
Leu Asn Asn Asp Gly Ala Ser Arg Ala Gly Phe Thr Ala Pro Ser 
5145 5150 5155 



gtg gag ggc caa tec gag gtg ate gee ctg 
Val Glu Gly Gin Ser Glu Val He Ala Leu 
5160 5165 



gcg cac gee gcg gcg 
Ala His Ala Ala Ala 
5170 



?K t cgc gac atc acc tac gtg gag gct cac ^ c acc 

Gly He Ser Ala Arg Asp He Thr Tyr Val Glu Ala His Gly Thr 
5175 5180 5185 

rit S£ p^o r tg 2?* gaC CCg atC gaa gtg gcg gc ^ ctc * cc c ^c 

Gly Thr Pro Leu Gly Asp Pro He Glu Val Ala Ala Leu Thr Arg 

5190 5195 5200 

??- ?? g ° at aCC gCt gaC a ° C gcc ttc acc ctc ggt 

Ala Phe Arg Ala His Thr Ala Asp Thr Ala Phe Cys Thr Leu Gly 



48388 



48433 



48478 



48523 



48568 



48613 



48658 



48703 



48748 



48793 



48838 



48883 



48928 



48973 



49018 



49063 
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5215 



gcg gtc aaa tec aac ate gga cac ctg gat 
Ala Val Lys Ser Asn He Gly His Leu Asp 
5220 5225 



gcg gcg gca gga gtg 
Ala Ala Ala Gly Val 
5230 



gec ggg gtg ate aag acg gtg cag gec ctg cgc cac egg etc ate 
Ala Gly Val He Lys Thr Val Gin Ala Leu Arg His Arg Leu He 
5235 5240 5245 

cct ccc acg etc cac ttc gag egg ccc aat ccc gcg etc cac ctg 
Pro Pro Thr Leu His Phe Glu Arg Pro Asn Pro Ala Leu His Leu 
5250 5255 5260 



gag cag age ccc ttc 
Glu Gin Ser Pro Phe 
5265 



ttc gtc aac ace cag ccg etc ccg tgg gag 
Phe Val Asn Thr Gin Pro Leu Pro Trp Glu 
5270 5275 



age ccg cgc ggg cct egg ctg gcg ggg gtg age tec ttt ggc att 
Ser Pro Arg Gly Pro Arg Leu Ala Gly Val Ser Ser Phe Gly He 
5280 5285 5290 

ggc ggc ace aac gee cac acg etc ttc gaa gag get ccg ccg ccg 
Gly Gly Thr Asn Ala His Thr Leu Phe Glu Glu Ala Pro Pro Pro 
5295 5300 5305 

ccc gee age ggc ccc ace cgt ccc aac cag gtg ctg ctg etc teg 
Pro Ala Ser Gly Pro Thr Arg Pro Asn Gin Val Leu Leu Leu Ser 
5310 5315 5320 

gee cga age acc age gcg etc gag cac ate gca gga egg etc gcg 
Ala Arg Ser Thr Ser Ala Leu Glu His He Ala Gly Arg Leu Ala 
5325 5330 ~ 5335 



gca cac ctg cgc cgc cac ccg gac ctg gaa 
Ala His Leu Arg Arg His Pro Asp Leu Glu 
5340 5345 



ctg gcg gac gtg gee 
Leu Ala Asp Val Ala 
5350 



ttt acc etc cag gtg 
Phe Thr Leu Gin Val 
5355 



ggg cga gee cgc ttc ccc tac egg cgc gee 
Gly Arg Ala Arg Phe Pro Tyr Arg Arg Ala 
5360 5365 



etc acc tgt cgc acc ctg gee gag gee atg 
Leu Thr Cys Arg Thr Leu Ala Glu Ala Met 
5370 5375 



gag cgc ctg gag get 
Glu Arg Leu Glu Ala 
5380 



ccc gag ccg egg cca 
Pro Glu Pro Arg Pro 
5385 



ccc gag ccg ctt get cac gag gga gag egg 
Pro Glu Pro Leu Ala His Glu Gly Glu Arg 
5390 " 5395 



ccc cct etc gtc atg etc ttt cca ggg cag ggc act ccg ctg gtc 
Pro Pro Leu Val Met Leu Phe Pro Gly Gin Gly Thr Pro Leu Val 
5400 5405 5410 

ggc acg gee cgt gcg etc cat gaa tec gag ccc acc ttc cgc cag 
Gly Thr Ala Arg Ala Leu His Glu Ser Glu Pro Thr Phe Arg Gin 
5415 5420 54 25 



gcg gtg gag cag tgt gee egg etc ctg agg 
Ala Val Glu Gin Cys Ala Arg Leu Leu Arg 
5430 5435 



cag acg ctg ggg ctg 
Gin Thr Leu Gly Leu 
5440 



gat gtc agg gag gtg etc ttc ccc tec gcg gag cag gag gag cag 



49108 



49153 



49198 



49243 



49288 
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49603 
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Asp val Arg Glu Val Leu Phe Pro Ser Ala Glu Gin Glu Glu Gin 
5445 5450 5455 

gca cgc egg ctg g C g gcg cag acg egg gtg gec cag ccc gcg etc 
Ala Arg Arg Leu Ala Ala Gin Thr Arg Val Ala Gin Pro Ala Leu 
5460 5465 5470 



ttc ace ctg gag tat 
Phe Thr Leu Glu Tyr 
5475 



gee ctg gcg cag ace tgg ctg ggc tgg ggg 
Ala Leu Ala Gin Thr Trp Leu Gly Trp Gly 
5480 5485 



t»» o° g Sf 9 g ° g Ctg g ° a ggC Cat agc °tg 9 gc «? a 9 Ctg gtg 

Leu Gin Pro Gin Ala Leu Ala Gly His Ser Leu Gly Glu Leu Val 

5490 5495 5500 

gcg gcg tgc etc gcg ggt gtc ttc tec ctg gaa gat gca ctt cag 
Ala Ala Cys Leu Ala Gly Val Phe Ser Leu Glu Asp Ala Leu Gin 
5505 5510 5515 

t!?, S? ?? a ° gg gga ° ag Ctc atg cag ST^a tgc cca ccg ggg 

Leu Val Ala Ala Arg Gly Gin Leu Met Gin Gly Cys Pro Pro Gly 
5520 5525 553 0 

??• M tff T ° tg ?? C gt ? CCg Ctg CCg gag gcc g ag °tg S c g ctg 
Ala Met Leu Ala Val Pro Leu Pro Glu Ala Glu Leu Ala Ala Leu 
5535 554O 5545 

ctg ggc agc gag ctg tgt ate gcc gcg gtg aac ggg cca egg gcc 
Leu Gly Ser Glu Leu Cys He Ala Ala Val Asn Sly Pro A?g lit 
5550 5555 5 560 

rlt {£? ff C o Cg S gt CCt Cta CCg gcg gta W 9cg ctc ace gcc 
Cys Val Ala Ser Gly Pro Leu Pro Ala Val Glu Ala Leu Thr Ala 
5565 5570 55?5 

gcg ctg gag agc egg ggc gtg tec agc egg cgc ctg gag act tec 
A.a Leu Glu Ser Arg Gly Val Ser Ser Arg Arg Leu Glu Thr Ser 
5580 5585 559Q 

Ss IS ttt ^ t Ct f? C tCC 3tg gag gCa tgC Cag ggg cca ctc 
His Ala Phe His Ser Ala Ser Met Glu Ala Cys Gin Gly Pro Leu 

5595 5600 5605 

^° a u g Ctg Ctg ° gg Cgc atg cgc ctg cag gcc cct cgc ctc ccc 
Thr Thr Leu Leu Arg Arg Met Arg Leu Gin Ala Pro Arg Leu Pro 

5610 5615 " 5620 



tgt gtc tec ggc ctc 
Cys Val Ser Gly Leu 
5625 



ace ggc egg tgg ctc ace ggc gag gag gcc 
Thr Gly Arg Trp Leu Thr Gly Glu Glu Ala 
5630 5635 



ac* gag cct acg tac tgg gcc cgc cag ctc cgc gag ccg gta cgt 
1111 pro Thr ^ Trp Ala Arg Gin Leu Arg Glu Pro Val Arg 

5640 5645 56 50 

ttc tec gag gcc ctc ga g acg ctc tgg agc ctc aag gag cca gtg 
Phe Ser Glu Ala Leu Glu Thr Leu Trp Ser Leu Lys Ilu Pro Val 
5655 5660 5555 

V ' I " 17 ?7 g gta ggc c cc ggc acc acg ctc acc gcg ctt gcg egg 
j-eu i,eu Sau Val Gly Pro Gly Thr Thr Leu Thr Ala Leu Ala Arg 
5670 5675 5680 
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2£ u ^ C Cg ° CCC gCC Cg ° aCC Cag 9*3 ^tg S C <J age ctt 

Arg HlS Pro Thr Arg Pro Ala Arg Thr Gin Glu Val Ala Ser Leu 

5685 5690 5695 

p™ ,7? ° Cg ga ° aCg gCC gtg ccg att <?cc gtg 

Pro Val Gin Pro Asp Thr Ala Val Pro Cys lie Glu Glu Ala Val 

5700 5705 5710 

ggt gag ctg tgg cag gec ggc ctg gag ctg gac tgg age gec eta 
Gly Glu Leu Trp Gin Ala Gly Leu Glu LeS Asp tS sir 111 LeS 
5715 5720 5725 

Hit lit ff* ^ C CaC CgC gCa CaC Ctt cca ccc tac ccc ttc 

His Ala Ala Pro Arg His Arg Ala His Leu Pro Pro Tyr Pro Phe 
5730 5735 574Q 



gag cgc cag cgc tac 
Glu Arg Gin Arg Tyr 
5745 



tgg att gaa ccg gag 
Trp He Glu Pro Glu 
5750 



get gee ccg cag ccg 
Ala Ala Pro Gin Pro 
5755 



cgc gcg cag cag ccg acg ccc gcg tec ctt gtg ccc ccg gag cag 
Arg Ala Gin Gin Pro Thr Pro Ala Ser Leu Val Pro Pro SS Gin 
5760 5765 5770 



ccc teg cgc gag gec 
Pro Ser Arg Glu Ala 
5775 

gag cag gca cct gec 
Glu Gin Ala Pro Ala 
5790 



ctg gaa gac tgg ttc tac gtg ccc ace tgg 
Leu Glu Asp Trp Phe Tyr Val Pro Thr Trp 
5780 5785 

acg age ggg ggg gga cag ccc ctg gcg ggc 
Thr Ser Gly Gly Gly Gin Pro Leu Ala Gly 
5795 saoo 



p™ S3 t ^ C atg gaC tCC tcg ggc cfc g g cc gag cag gtg 

Pro Val Leu Ala Phe Met Asp Ser Ser Gly Leu Ala Glu Gin Vat 
5805 5810 5815 

Leu 111 a? 0 r Ctg CCC gCg gaC tCC ggC gcg ctc ctc acc cgc 

Leu Ala Ala Leu Trp Pro Ala Asp Ser Gly Ala Leu Leu Thr Arg 

5820 5825 5830 

£S l° 9 ^ C g ? g CaC tat gag Caa ctc a gt g*g cac gee ttc 

Val Glu Pro Ala Gly His Tyr Glu Gin Leu Ser Glu His Ala Phe 
5835 5840 5845 

*™ r ctc * gc S cc g f g agc gag gag gac fc gg g ac g c g ^ c ttc cag 

Arg Leu Arg Pro Glu Ser Glu Glu Asp Trp Asp Ala Leu Phe X! 

5850 5855 5860 

gcg etc cag tec cag g gc egg ctc ccc cgt cgc ate etc cat gee 
Ala Leu Gin Ser Gin Gly Arg Leu Pro Arg Arg He Leu His Sa 
5865 5870 5875 

tgg get ctc acc gcg gag ccc ggc ccc tgt ace ccg gac ggg gaa 
Trp Ala Leu Thr Ala Glu Pro Gly Pro Cys Thr Pro Lp lly £J 
5880 5885 5 89 0 



gcg gtg ctg gaa cag 
Ala Val Leu Glu Gin 
5895 



ggc ttc ttc agc ctg ctg egg ctg gee cgt 
Gly Phe Phe Ser Leu Leu Arg Leu Ala Arg 
5900 5905 



gcg ctc ggc cgc cac gcg cct gag cgc cct 
Ala Leu Gly Arg His Ala Pro Glu Arg Pro 
5910 5915 



gta cag etc gag gtg 
Val Gin Leu Glu Val 
5920 
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ctg tec age ttc gtc cac gcg gtg ggc ccg egg gag ccg etc gag 
Leu Ser Ser Phe Val His Ala Val Gly Pro Arg Glu Pro Leu Glu 
592 5 5930 5935 

ccg etc aag gee acg ctg ctg ggg gee tgt gcg gta ctt ccc ctg 
Pro Leu Lys Ala Thr Leu Leu Gly Ala Cys Ala Val Leu Pro Leu 
5940 5945 5950 

gag tac cct cac gtg cag tgc cgc ace att gac gtg egg ccc ggg 
Glu Tyr Pro His Val Gin Cys Arg Thr He Asp Val Arg Pro Gly 
5955 5960 5965 

age gag cca egg gaa gtg ctg gtg agg age ctg gec gec gag ctcr 
Ser Glu Pro Arg Glu Val Leu Val Arg Ser Leu Ala Ala Glu LeS 
5970 5975 5980 

gec get ccc atg gga gag age ccg gtg gec tgg cgc gac ggg cag 
Ala Ala Pro Met Gly Glu Ser Pro Val Ala Trp Arg Asp Gly Gin 
5985 5990 5995 

egg tac gtg cgc cgc gec ace egg cag agg etc gag gee teg egg 
Arg Tyr Val Arg Arg Ala Thr Arg Gin Arg Leu Glu Ala Ser Arg 
6000 6005 6010 

ccc ctg cgc age ctt egg gag cga ggc gtg tac ctg gta gca gga 
Pro Leu Arg Ser Leu Arg Glu Arg Gly Val Tyr Leu Val Ala Gly 
6015 6020 6025 

ggg ctg ggt gga ate ggc ctg gtg ctg gcg cga gcg ctg gec cag 
Gly Leu Gly Gly He Gly Leu Val Leu Ala Arg Ala Leu Ala Gin 
6030 6035 6040 

egg gcg cgt get egg ctg gcg eta etc acc cac teg ccc ttc cct 
Arg Ala Arg Ala Arg Leu Ala Leu Leu Thr His Ser Pro Phe Pro 
6045 6050 6055 

ccc cga gag cag tgg gag cag tgg ctg gag gaa gec ccg gcg cac 
Pro Arg Glu Gin Trp Glu Gin Trp Leu Glu Glu Ala PrS IS His 
6060 6065 6070 

ccg gag ccg gcg tgg egg age gag gec gac ccc teg gag cgc cgc 
Pro Glu Pro Ala Trp Arg Ser Glu Ala Asp Pro Ser Glu Arg Arg 
6075 6080 6085 

^ g ^ of g C ? C CgC atC Cgc tgc ctg ct S ^g ctg gag cag etc 
Arg Thr Gin Hxs Arg He Arg Cys Leu Leu Glu Leu Glu Gin Leu 

6090 6095 6100 

ggc gec gag gtg cag gtg tac acg gcg gac gtg gcg gag gag gee 
Gly Ala Glu Val Gin Val Tyr Thr Ala Asp ?al Ala SlS Glu HI 
6105 6110 6115 



gec gtg cgc teg gtg 
Ala Val Arg Ser Val 
6120 



gtg gag cag gtg cac gec cgc tgg ggg aag 
Val Glu Gin Val His Ala Arg Trp Gly Lys 
6125 6130 



51223 



51268 



51313 



51358 



51403 



51448 



51493 



51538 



51583 



51628 



51673 



51718 



51763 



51808 



= esc ggc gtg ctg 
He His Gly Val Leu 



cac gec gec gec acg ttc gac gac ggc gtc 
His Ala Ala Ala Thr Phe Asp Asp Gly Val 
6135 6140 6145 



51853 



etc cgc acg cac gag cag tec teg egg gee ctg cgc acc 
- eu Thr His Glu Gin Ser Ser Arg Ala Leu Arg Thr 



51898 
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6150 

aag gtg egg ggc age 
Lys Val Arg Gly Ser 
6165 
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6160 



atg gtg ctg cac gag gtg ctg gcg age gag 
Met Val Leu His Glu Val Leu Ala Ser Glu 
6170 6175 



ggg ttg gat tgg ttc gee ctg tgc tec teg ctg gcg teg gcg ctg 
Gly Leu Asp Trp Phe Ala Leu Cys Ser Ser Leu Ala Ser Ala Leu 
6180 6185 6190 

ggc tea ttc ggc cag gcg gac tac tgc gcg gee aat gec ttc cag 
Gly Ser Phe Gly Gin Ala Asp Tyr Cys Ala Ala Asn Ala Phe Gin 
6195 6200 6205 



gat gcg tat gcg cac cac ctg cgc egg cag 
Asp Ala Tyr Ala His His Leu Arg Arg Gin 
6210 6215 



ggc ttc acg gga gcg 
Gly Phe Thr Gly Ala 
6220 



?2 lit r g T C * OT ggC aCg tgg aga gat acg ggg *cg gec atg 
Leu Ala Leu Asp Trp Gly Thr Trp Arg Asp Thr Gly Ala Ala Met 

6225 6230 6235 

egg ctg gtg gca cgc acc cga egg ggg ggc cat gag aag ccg ccc 
Arg Leu Val Ala Arg Thr Arg Arg Gly Gly His Glu Lys Pro Pro 
6240 6245 6250 

acg ccg etc acc cac cca etc ttc gat tgc gaa cag cgc gag ccg 
Thr Pro Leu Thr His Pro Leu Phe Asp Cys Glu Gin Arg Glu Pro 
6255 6260 6265 



ggc ggg acg cac tgg etc ggc etc acg ctg 
Gly Gly Thr His Trp Leu Gly Leu Thr Leu 
6270 6275 



agg ggt gga gag gac 
Arg Gly Gly Glu Asp 
6280 



tgg gtg gtg gac gag cac egg ctg caa ggg gtg cca aca ctg ccc 
Trp Val Val Asp Glu His Arg Leu Gin Gly Val Pro Thr Leu Pro 
6285 6290 6295 

nf* ta ° ° tg gag ° tg gCg cgg gcg ^cg tgt gee cag gcg 

Gly Val Ala Tyr Leu Glu Leu Ala Arg Ala Ala Cys Ala Gin Ala 
6300 6305 6310 

ctg ggg gec gag gcg gtg gag ctg gcg gag ctg ttg ctg ctg gag 
Leu Gly Ala Glu Ala Val Glu Leu Ala Glu Leu Leu Leu Leu Glu 
6315 6320 6325 



cct ctg acg gta cct 
Pro Leu Thr Val Pro 
6330 



cga ggc gaa tec agg 
Arg Gly Glu Ser Arg 
6335 



cag gtc egg gtg gtg 
Gin Val Arg Val Val 
6340 



etc cag ccc gag ggg cag get cat gec ctg egg gtg gag age egg 
Leu Gin Pro Glu Gly Gin Ala His Ala Leu Arg Val Glu Ser Arg 
6345 6350 6355 



teg gag gaa gcg egg 
Ser Glu Glu Ala Arg 



gga tgg aat gag cat gcg egg ggc egg gtg 
Gly Trp Asn Glu His Ala Arg Gly Arg Val 
6360 6365 ~ 6370 



cgc gee gtg cct egg ctg gee gag cgc ate cag ccc gag ctg ctg 
Arg Ala Val Pro Arg Leu Ala Glu Arg lie Gin Pro Glu Leu Leu 
6375 6380 6385 
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cgc gee gee tgc gaa cac gag cag ccc gtg ccc gga gag ccc cag 



52618 



WO 2004/022586 



49 



PCT/EP2003/009780 



Arg Ala Ala Cys Glu His Glu Gin Pro Val Pro Gly Glu Pro Gin 
6390 6395 6400 

n,?, ^ ° Cfc gtC CaC gcc gga gca c ^ c tgg cat ggc etc ttc 

Glu Gin Gly Pro Val His Ala Gly Ala Arg Trp His Gly Leu Phe 
6405 6410 6415 



cag tgg gtt cgc cgc ggc cct cgc cag gcc 
Gin Trp Val Arg Arg Gly Pro Arg Gin Ala 
6420 6425 



etc gcc cag etc gcg 
Leu Ala Gin Leu Ala 
6430 



rlf ^ g CCC ttC CaC ggt gac ctc ga <* cgc ttc gag etc cac 

Leu Pro Glu Pro Phe His Gly Asp Leu Glu Arg Phe Glu Leu His 

6435 6440 6445 

?? g f tc atg gac atg gcc acc agc ttc gcc att ccc esc sga 

Pro Ala Leu Met Asp Met Ala Thr Ser Phe Ala He Pro Gly Gly 
6450 6455 6460 

o CC ^ T Ct ° gCC fctt ggC tat gag cgc gt 3 ctc a tt cac ggt 
Val Pro Trp Leu Ala Phe Gly Tyr Glu Arg Val Leu He His Gly 

6465 6470 6475 

ccg ctg cca ccg cag gtg ctc agc cac gtg agc ctg ccc gag gag 
Pro Leu Pro Pro Gin Val Leu Ser His Val Ser Leu Pro Glu Glu 
6480 6485 6490 

teg cag get ggc gcg caa caa ctc egg eta cag gtg egg eta eta 
Ser Gin Ala Gly Ala Gin Gin Leu Arg Leu Gin Val A?g LeS LeS 
6495 6500 ' 6505 



gat ctg gag ggc tgg gag egg gtg egg ate 
Asp Leu Glu Gly Trp Glu Arg Val Arg He 
6510 6515 



gat ggg tac ctg ctg 
Asp Gly Tyr Leu Leu 
6520 



egg ccg ctg aag ccc agc gac gcc agt gtc gag ccg gcg gcg ccc 
Arg Pro Leu Lys Pro Ser Asp Ala Ser Val Glu Pro Ala Ala Pro 
6525 6530 6535 

aac gtg gaa gtc gcc gtg ggg acg ccg ggg ctg ctg gag agc ctt 
Asn Val Glu Val Ala Val Gly Thr Pro Gly Leu Leu Glu Ser Leu 
6540 6545 6550 

ggc ctg cgc cgc tgc acc cgc ccc gcc ccc ggg ccg cgc cag gtg 
Gly Leu Arg Arg Cys Thr Arg Pro Ala Pro Gly Pro Arg Gin Val 
6555 6560 6565 

gag ate gag gtg gag gcc get ggg ctc aac ttc ctg gac gtg ctg 
Glu He Glu Val Glu Ala Ala Gly Leu Asn Phe Leu Asp Va? Leu 
6570 6575 6580 

I?5 111 It? r? g atg CCg gCa ttg gag gcg gag gaa gta 

Gly Ala Leu Gly Met Met Pro Ala Leu Glu Ala Glu Glu Ser Val 

6585 6590 6595 

ctg ggg cgc gag tgc tec gga cgc att gcc gcc gtg ggc gag ggc 
Leu Gly Arg Glu Cys Ser Gly Arg He Ala Ala Val Gly Glu Gly 
66 00 6605 6610 
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ggc tgc ttc cgc tec tac gtg ctg gtg gat gag age cag gtg gtg 
Gly Cys Phe Arg Ser Tyr Val Leu Val Asp Glu Ser Gin Val Val 
6630 6635 6640 



cgc agg ccc gec teg 
Arg Arg Pro Ala Ser 



ctg ggg etc gee gag ggg gcg gee cag atg 
Leu Gly Leu Ala Glu Gly Ala Ala Gin Met 
6645 6650 6655 



gtg ccg ttc gee acg gcg tac ttc gee ctg cac ace gtg ggc egg 
Val Pro Phe Ala Thr Ala Tyr Phe Ala Leu His Thr Val Gly Arg 
66 60 6665 6670 

ttg egg cgc ggc gag cgc ate etc ate cac gee gcg gee gga ggg 
Leu Arg Arg Gly Glu Arg lie Leu He His Ala Ala Ala Gly Gly 
6675 6680 6685 

ctg ggt ctg gee gee gtt cag etc gca tec egg ace ggg gcg gag 
Leu Gly Leu Ala Ala Val Gin Leu Ala Ser Arg Thr Gly Ala Glu 
6690 6695 ~ 6700 

ata ttg gee ace gcg ggc agt gag cag aag cgc gag tac ctg cgc 
He Leu Ala Thr Ala Gly Ser Glu Gin Lys Arg Glu Tyr Leu Arg 
6705 6710 6715 

teg etc ggc att get cac gtg ctg gac teg cgc age acc tec ttc 
Ser Leu Gly He Ala His Val Leu Asp Ser Arg Ser Thr Ser Phe 
6720 6725 6730 



gtc age gaa gtg cgc 
Val Ser Glu Val Arg 
6735 



gag cgc acc ggc ggg cgt ggg gtg gat gtg 
Glu Arg Thr Gly Gly Arg Gly Val Asp Val 
6740 6745 



gtg etc aac teg ctg gcg gga gag ctg etc 

Val Leu Asn Ser Leu Ala Gly Glu Leu Leu 
6750 6755 

gtg ctg gee ccg cac ggc cgc ttc ctg gag 

Val Leu Ala Pro His Gly Arg Phe Leu Glu 
6765 6770 



ctg gcg ggc ctg tec 
Leu Ala Gly Leu Ser 
6760 

ctg ggg aag agg gac 
Leu Gly Lys Arg Asp 
6775 



ctg tat gcg gac cag cag gtg ggc etc cga acc ctg gee cga ggg 
Leu Tyr Ala Asp Gin Gin Val Gly Leu Arg Thr Leu Ala Arg Gly 
6780 6785 6790 

cag act ttc gee gee att gac ttc ggt ccc cac cac ccg gac ttc 
Gin Thr Phe Ala Ala He Asp Phe Gly Pro His His Pro Asp Phe 
6795 6800 6805 

cga gcg gtg etc gag gag gtg gee acg caa etc acc cag ggc cag 
Arg Ala Val Leu Glu Glu Val Ala Thr Gin Leu Thr Gin Gly Gin 
6810 6815 6820 

etc gag cca ttg ccc acc cgc etc ttc ccc gcg egg cag gtg gee 
Leu Glu Pro Leu Pro Thr Arg Leu Phe Pro Ala Arg Gin Val Ala 
6825 6830 ~ 6835 

gaa gee ttc age ttc atg gee cgc gcg ctg cac ate ggt cgc gtc 
Glu Ala Phe Ser Phe Met Ala Arg Ala Leu His He Gly Arg Val 
6840 6845 "* 6850 
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egg ggc tec agg ccc gca ccg gtg gec gta cct cct tgg gag gac 

Arg Gly Ser Arg Pro Ala Pro Val Ala Val Pro Pro Trp Glu Asp 
6870 6875 6880 

ccg egg ctg gcg ggc ggc ate tec tct gaa gag ggt gcg gag gee 

Pro Arg Leu Ala Gly Gly lie Ser Ser Glu Glu Gly Ala Glu Ala 
6885 6890 6895 

ttc ctg cgc gcg ttg gag caa ggg gca ccg cag etc ate ate tec 

Phe Leu Arg Ala Leu Glu Gin Gly Ala Pro Gin Leu lie lie Ser 
6900 6905 6910 



ccc cag gac ttc age 
Pro Gin Asp Phe Ser 
6915 



teg ctg ctg cgc ggc ctg ggc ggc age cag 
Ser Leu Leu Arg Gly Leu Gly Gly Ser Gin 
6920 6925 



ggc gtg cgc gaa aag gag cgc etc gtc acc ggg cgc gee gec gec 

Gly Val Arg Glu Lys Glu Arg Leu Val Thr Gly Arg Ala Ala Ala 
6930 6935 6940 

gee gag ccg cag gec ttg cca ccc tec teg ctg gag cag etc ate 

Ala Glu Pro Gin Ala Leu Pro Pro Ser Ser Leu Glu Gin Leu lie 
6945 6950 6955 

gag cag gtg tgg cgc aag cac ctg ggt gtg gag cgc gtg cag ccc 

Glu Gin Val Trp Arg Lys His Leu Gly Val Glu Arg Val Gin Pro 
6960 6965 6970 



acg gac age ttc ttc 
Thr Asp Ser Phe Phe 
6975 



cag ctt gga gga gac teg ctg ctg ggc ate 
Gin Leu Gly Gly Asp Ser Leu Leu Gly He 
6980 6985 



caa gtg gcg gcg gat etc cgc agg cac ctg ggt gtg gag ctg ccc 
G^n Val Ala Ala Asp Leu Arg Arg His Leu Gly Val Glu Leu Pro 
6990 6995 7000 

acg gee acc etc ttc age cac ccc acc etc gee gcg ctg gee gcg 
Thr Ala Thr Leu Phe Ser His Pro Thr Leu Ala Ala Leu Ala Ala 
7005 7010 7015 



get ctg egg get cga cag ggt gag gee gcg 
Ala Leu Arg Ala Arg Gin Gly Glu Ala Ala 
7020 7025 



get ccc act gee ccc 
Ala Pro Thr Ala Pro 
7030 



gcg cca gcg etc gtg ccg gat cca gec gcg cgc ttc gag cct ttc 
Ala Pro Ala Leu Val Pro Asp Pro Ala Ala Arg Phe Glu Pro Phe 
703 5 7040 7045 

ccc etc acg gat gtg cag gaa gec tac tgg gtg ggt cgc cgc teg 
Pro Leu Thr Asp Val Gin Glu Ala Tyr Trp Val Gly Arg Arg Ser 
7050 7055 7060 

gee the gag etc ggc ggc gtc gee gec cat ggc tac ttc gaa ate 
Ala Pne Glu Leu Gly Gly Val Ala Ala His Gly Tyr Phe Glu He 
7065 7070 7075 



Z^c "g ggg ctg 

'3.1'j. Ser Pro Gly Leu 



gag gtg gag cgc ttc ate caa tgc tgg cgc 
Glu Val Glu Arg Phe He Gin Cys Trp Arg 
7080 7 085 " 7090 
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cag etc ctg cag cgc cac gac atg ctg cgc atg gtg gt g etc ccg 
jjxh Gin Arg His Asp Met Leu Arg Met Val Val Leu Pro 
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gat ggg egg cag cag 
Asp Gly Arg Gin Gin 
7110 

gag gtg gtg gag ctg 
Glu val Val Glu Leu 
7125 

cgc egg etc cag ctg 
Arg Arg Leu Gin Leu 
7140 

age gac cgc tgg ccg 
Ser Asp Arg Trp Pro 
7155 

gga ggc gtc cgc ate 
Gly Gly Val Arg He 
7170 

gcg tgg agt tea gcg 
Ala Trp Ser Ser Ala 
7185 

cac gag ccg ggc egg 
His Glu Pro Gly Arg 
7200 

gac tac gtg ctg gcg 
Asp Tyr Val Leu Ala 
7215 

gag cgc gec cgc gca 
Glu Arg Ala Arg Ala 
7230 



52 

7100 



gtg etc gag cag gtg ccg gag 
Val Leu Glu Gin Val Pro Glu 
7115 

egg ggg etc tec ccc cag gag 
Arg Gly Leu Ser Pro Gin Glu 
7130 

cgt gag cgc atg gec cac cag 
Arg Glu Arg Met Ala His Gin 
7145 

etc ttc gag ctg gtg etc tgc 
Leu Phe Glu Leu Val Leu Cys 
7160 

cac atg age atg gat gec ctg 
His Met Ser Met Asp Ala Leu 
7175 

gtg ctt egg cag gac ttc gee 
Val Leu Arg Gin Asp Phe Ala 
7190 

ccg ctg gag ccg ctg gee ate 
Pro Leu Glu Pro Leu Ala He 
7205 

gag cgc egg ctg cgc gag ggc 
Glu Arg Arg Leu Arg Glu Gly 
7220 



PCTYEP2003/009780 

7105 



tac acg 
Tyr Thr 



gec gag 
Ala Glu 



gtc ctg 
Val Leu 



egg tac 
Arg Tyr 



atg ctg 
Met Leu 



cag ctg 
Gin Leu 



ace ttc 
Thr Phe 



gag gee 
Glu Ala 



ccg 
Pro 
7120 

tec 
Ser 
7135 

cgc 
Arg 
7150 

gag 
Glu 
7165 

gat 
Asp 
7180 

tac 
Tyr 
7195 

cgc 
Arg 
7210 

cat 
His 
7225 



tac tgg tgg get egg 
Tyr Trp Trp Ala Arg 
7235 



cca ccg ccc gag ctg 
Pro Pro Pro Glu Leu 
7245 

cac gcg egg ttc ace 
His Ala Arg Phe Thr 
7260 

tgg gee egg etc cag 
Trp Ala Arg Leu Gin 
7275 

teg gec gec tgc atg 
Ser Ala Ala Cys Met 
7290 

age cgt cac ccg cgc 
Ser Arg His Pro Arg 
7305 

ttg ccc ctg cac ccg 
Leu Pro Leu His Pro 
7320 



ccc ctg gtg aag gaa 
Pro Leu Val Lys Glu 
7250 

cac cgc gag get egg 
His Arg Glu Ala Arg 
7265 

gag egg gcg cgc gee 
Glu Arg Ala Arg Ala 
7280 

gee gee ttc gee gag 
Ala Ala Phe Ala Glu 
7295 

ttc ace etc aac etc 
Phe Thr Leu Asn Leu 
7310 

cag gtg gac gag ctg 
Gin Val Asp Glu Leu 
7325 



ctg gac 
Leu Asp 



ccc teg 
Pro Ser 



etc gag 
Leu Glu 



acg ctg 
Thr Leu 



cag ctg 
Gin Leu 



cca cac 
Pro His 



cac ggc 
His Gly 



gtg ctt 
Val Leu 



ace etc 
Thr Leu 



etc ace 
Leu Thr 



gee cgc 
Ala Arg 



ttc cag 
Phe Gin 



gtg ggc 
Val Gly 



gac ttc 
Asp Phe 



ccg 
Pro 
7240 

gag 
Glu 
7255 

cgc 
Arg 
7270 

ccc 
Pro 
7285 

tgg 
Trp 
7300 

cgc 
Arg 
7315 

acc 
Thr 
7330 



tec ctg gtc ctg ctg gag gta gag gca cac gcg gcg age acc ttc 
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Ser Leu Val Leu Leu Glu Val Glu Ala His Ala Ala Ser Thr Phe 
7335 7340 7345 

gcc gag cgt gcc tec egg etc cag gca cag eta tgg egg gac ctg 

Ala Glu Arg Ala Ser Arg Leu Gin Ala Gin Leu Trp Arg Asp Leu 
7350 7355 7360 

gag cac ggc age gtg age gcc gtg cag etc ate cgc gag etc gtc 

Glu His Gly Ser Val Ser Ala Val Gin Leu He Arg Glu Leu Val 
7365 7370 7375 



cgc ace ggc cgc cgc tec ccg ggc gcc ate 
Arg Thr Gly Arg Arg Ser Pro Gly Ala He 
7380 7385 



atg ccc gtc gtc ttc 
Met Pro Val Val Phe 
7390 



acc age ate etc age ctg gat gcg egg cgc ggc ccc cag ggc age 

Thr Ser He Leu Ser Leu Asp Ala Arg Arg Gly Pro Gin Gly Ser 
7395 7400 7405 

etc tec ttc ttc gag gga gaa ctg gtg tac age ate age cag act 

Leu Ser Phe Phe Glu Gly Glu Leu Val Tyr Ser He Ser Gin Thr 
7410 7415 7420 



ccc cag gtg tgg ctg gac cac gga gtc cac 
Pro Gin Val Trp Leu Asp His Gly Val His 
7425 7430 



gag gag gag ggg gcg 
Glu Glu Glu Gly Ala 
7435 



etc gtc ctg gcg tgg gac teg gtg gag gcg etc ttc cct ccg ggc 
Leu Val Leu Ala Trp Asp Ser Val Glu Ala Leu Phe Pro Pro Gly 
7440 7445 7450 



atg gtg gac gac atg 
Met Val Asp Asp Met 
7455 



ttc cac gcc tac cag egg ctg ctg ggg gca 
Phe His Ala Tyr Gin Arg Leu Leu Gly Ala 
7460 ^ 7465 



etc gcc gag gag gag caa gcg tgg gag ggc gag ctg ccg gag ctg 
Leu Ala Glu Glu Glu Gin Ala Trp Glu Gly Glu Leu Pro Glu Leu 
7470 7475 7480 

ctg cct cct gcc cag cgt gag ttg etc gcg cgc tac aac gcc acc 
Leu Pro Pro Ala Gin Arg Glu Leu Leu Ala Arg Tyr Asn Ala Thr 
7485 7490 7495 

cag gcg ccg egg ccc age gga egg ctg gag gag ggc ttc ttc acc 
Gin Ala Pro Arg Pro Ser Gly Arg Leu Glu Glu Gly Phe Phe Thr 
7500 7505 7510 

cag gcg egg etc cac ccc gag etc ccc gcg ctg etc gca ccg gag 
Gin Ala Arg Leu His Pro Glu Leu Pro Ala Leu Leu Ala Pro Glu 
7515 7520 7525 

cgc acc ctg age tac ggc gag ctg gca agg cga gcc cag gcg ctg 
Arg mr Leu Ser Tyr Gly Glu Leu Ala Arg Arg Ala Gin Ala Leu 
7530 7535 7540 

gcc gcg cgc eta cgc gag ctg gag gtg cag cct cag gag ttg gtg 
Ala Ala Arg Leu Arg Glu Leu Glu Val Gin Pro Gin Glu Leu Val 
7545 755O 7555 

gcc act gcc atg cac aag ggc tgg gag cag gcc acc gcc gtg etc 
Ala He Ala Met His Lys Gly Trp Glu Gin Ala Thr Ala Val Leu 
7560 7565 7570 
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ggc gtg ctg cag gcg gcc gcg gcc tac ctg ccg ttg gat cca gag 56173 
Gly Val Leu Gin Ala Ala Ala Ala Tyr Leu Pro Leu Asp Pro Glu 
757 5 7580 " 7585 



cag ccg ccg etc egg etc cac cag etc ctg gag gag ggg ccc gcc 
Gin Pro Pro Leu Arg Leu His Gin Leu Leu Glu Glu Gly Pro Ala 
7590 7595 7600 

cgc gtg gtg etc acc cag tec tec ttg ctg cac acc gtg ccc tgg 
Arg Val Val Leu Thr Gin Ser Ser Leu Leu His Thr Val Pro Trp 
7 605 7610 7615 

ccg ccg ggt gtg cag gtg ate gcc gtg gac gaa etc gag cct gca 
Pro Pro Gly Val Gin Val lie Ala Val Asp Glu Leu Glu Pro Ala 
7620 7 625 7630 

Th^ T° CCg Cta CCg CCt cgc ggt aca CC S gag cac etc 

Tnr Glu Ala Pro Pro Leu Pro Pro Arg Gly Thr Pro Glu His Leu 

7 "5 7640 7645 

gcc tac gtc ate tat acc tec ggc tec acc ggc aaa ccc aag ggt 
Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr Gly Lys Pro Lys Gly 
7650 7655 7660 

S? S* Hf* C3C Cgg gcc gcg ctc aac acc STtg gtg gac etc 

Val Ala He Glu His Arg Ala Ala Leu Asn Thr Val Val Asp Leu 

766 5 7670 7675 



aac acc cgc ttt ggc gtc ggc ccg gag gat 
Asn Thr Arg Phe Gly Val Gly Pro Glu Asp 
7680 7685 



egg gtg ctg ggc ctg 
Arg Val Leu Gly Leu 
7690 



tec gcg ctc acc ttc gac ctg teg gtg tac gac gtg ctg ggg ctg 
Ser Ala Leu Thr Phe Asp Leu Ser Val Tyr Asp Val Leu Gly Leu 
76 95 7700 7705 



ctc ggc gcg ggc gga gcg ctg gtg ctg ccc 
Leu Gly Ala Gly Gly Ala Leu Val Leu Pro 
7710 7715 



gcg gcg gag gcg gag 
Ala Ala Glu Ala Glu 
7720 



aag gat ccc gcc cac 
Lys Asp Pro Ala His 
7725 



tgg tgg gag egg ctg gtg get ggg egg gtc 
Trp Trp Glu Arg Leu Val Ala Gly Arg Val 
7730 7735 



acg gtg tgg aac tec acc ccg gcg ctg atg 
Thr Val Trp Asn Ser Thr Pro Ala Leu Met 
7740 7745 



ctg ctg ctg gtg gag 
Leu Leu Leu Val Glu 
7750 



tac gcc gag cag cgc ggg ctg aag ctg ccc gcg gcg ctg cgc ctg 
Tyr Ala Glu Gin Arg Gly Leu Lys Leu Pro Ala Ala Leu Arg Leu 
7755 7760 7765 

r tC ag ° ggC gaC tgg at ° CCC gtg gcg cta c <=g gat cgc 
Val Met Leu Ser Gly Asp Trp He Pro Val Ala Leu Pro Asp Arg 

777 0 7775 7780 

S« Hi T tC g ? C ^ gSC gtg gtg gtg agC Ctg ^ ^ C 

He Arg Ala Leu Gly Arg Asp Val Gin Val Val Ser Leu Gly Gly 
77 85 7790 7795 

g f° Su C g f g gCt tcc atc tgg tcc att ST CC tac ccc att ggc cag 
Ala Thr Glu Ala Ser He Trp Ser He Ala Tyr Pro He Gly Gin 
7 800 7805 7810 
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gtg gca ccg cag tgg aag age att ccc tat ggc atg ccg ctg gec 

Val Ala Pro Gin Trp Lys Ser lie Pro Tyr Gly Met Pro Leu Ala 
7815 7820 7825 

aat cag cga ttc cac gtg ctg gac ggg egg ctg gag gca egg ccc 

Asn Gin Arg Phe His Val Leu Asp Gly Arg Leu Glu Ala Arg Pro 
7830 7835 7840 

tgg tgg gtg ccg ggc gag ctg tac att ggc ggc gag gcrcr eta aca 

Trp Trp Val Pro Gly Glu Leu Tyr He G?y Sly SlS lly Leu Sa 
7845 7850 7355 

cgc gag tac tgg agg gat gag ccg etc ace gca acg cgc ttc ate 

Arg Glu Tyr Trp Arg Asp Glu Pro Leu Thr Ala Thr Arg Phe He 
7860 7865 7370 



cgt cac ccg cgc aca 
Arg His Pro Arg Thr 
7875 



ggc gag egg ctg tac cgc ace ggg gac caa 
Gly Glu Arg Leu Tyr Arg Thr Gly Asp Gin 
7880 7885 



ggg cgc atg ctg ccc gaa ggg age ate gag 
Gly Arg Met Leu Pro Glu Gly Ser He Glu 
7890 7895 



ttc etc ggc egg gag 
Phe Leu Gly Arg Glu 
7900 



gat ctg cag gtc aag gtg cag ggc ttc cgc gtg gag ctg ggg gag 
Asp Leu Gin Val Lys Val Gin Gly Phe Arg Val Glu Leu Gly Glu 
7505 7910 ?915 



att gaa gcg gcg ctg gee cag cac ccg gcg 
He Glu Ala Ala Leu Ala Gin His Pro Ala 
7920 7925 



eta teg gcg age gtg 
Leu Ser Ala Ser Val 
7930 



gtg gtg gca cga gga 
Val Val Ala Arg Gly 
7935 

tac gcg gtt cct cgc 
Tyr Ala Val Pro Arg 
7950 



gag cct egg gga gtg egg egg ctg gtg gee 
Glu Pro Arg Gly Val Arg Arg Leu Val Ala 
7940 7945 



teg ggt cag acg ccc 
Ser Gly Gin Thr Pro 
7955 



gcg gca ggc gag ctg 
Ala Ala Gly Glu Leu 
7960 



cgc cgc tac ctg gcg gag egg eta ccc gcg tac atg gtg ccc tec 
Arg Arg Tyr Leu Ala Glu Arg Leu Pro Ala Tyr Met Val Pro Ser 
7965 797O 7975 

?? C Si* ^ T Ct ° Ctg gag tC ° Ctg CCC cgc aac c ^ c aac ggg aag 
Ala Phe Val Leu Leu Glu Ser Leu Pro Arg Ser Arg Asn lly Ly s 

7980 7985 7990 

ate gee cga gat cag etc ccc gag ccg cag cag aca cag gga ctg 
He Ala Arg Asp Gin Leu Pro Glu Pro Gin Gin Thr Gin Gly Leu 
7995 8000 8005 

111 A?a SS SK li 9 ??* ^ CtC gtg gag Cgg Ct * ^ca 

Ala Ala Gin Ala Ala Ala Ala Asp Pro Leu Val Glu Arg Leu Ala 

8010 8015 ~ 8020 

gee ctg gtg aaa gag gcg ctg agg etc gag cgc gtg gag ccc caa 
Ala Leu Val Lys Glu Ala Leu Arg Leu Glu Arg Val 3E Pro Gin 
8025 8030 8035 
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8050 



etc ate aac cgc ctg gag gca gag ctg cag ttc cgc ccg cga ctg 57613 
Leu lie Asn Arg Leu Glu Ala Glu Leu Gin Phe Arg Pro Arg LeS 
8055 8060 8065 

Ha !™ rf C Sf° 2? ff CC ° aC ° gtg Cag ggg ctc gcc acg cfc c 57658 

Ala Asp lie Tyr Glu Asn Pro Thr Val Gin Gly Leu Ala Thr Leu 

8070 8075 8080 

cac caa gag aag aca aag age cag gga gag gga ggc get ccg egg 57703 
His Gin Glu Lys Thr Lys Ser Gin Gly Glu Gly Siy Ala Prl a?J 
8085 8090 8095 

rI C I? 1 9 , Q ° CC CgC tCC acg Ctt ctg ccc CTCC ^g gag tgg ggg 57748 
Leu Thr Ala Pro Arg Ser Thr Leu Leu Pro Ala Glu Glu Trp Gl? 

8100 8105 8110 

2S 1*1 ° gC ° Cg ggC Ctg Cgc cgc ttc ccc S at 57793 

Arg Phe Lys Ala Asn Arg Pro Gly Leu Arg Arg Phe Pro Asp Gly 

8115 8120 8125 

ih? p™ ri 3 fr tS Ctg CCa gg ° agC ggC ttg gcg ccg gcc ccc 57838 

Thr Pro Glu Val Ala Leu Pro Gly Ser Gly Leu Ala Pro Ala Pro 

8130 8135 8140 

gag gag ctg aca get ctc gag cgc cgc cgc agt gtg cgc acc tac 57883 
Glu Glu Leu Thr Ala Leu Glu Arg Arg Arg Ser Va? Arg Thr ijr 
8145 8150 8155 

tec etc gag ccc gtg age cat gag cag ttg gga egg ctg ctg gcc 57928 
Ser Leu Glu Pro Val Ser His Glu Gin Leu Gly Arg Leu Leu i£a 
8160 8165 8170 

p™ r tt9 » g ° tgg gag gtg cag ggc tcg cgg c 3t tac ctc tac 57973 

Pro Leu Arg Glu Trp Glu Val Gin Gly Ser Arg Arg Tyr Leu Tyr 

8175 8180 8185 

gcc tec gcg ggc ggg etc tac ccg gtg cag ctc tac ctc cac ctc 5 8018 

Ala Ser Ala Gly Gly Leu Tyr Pro Val Gin Leu Tyr Leu His Leu 
8190 3195 8200 

aag ccc ggc egg gcg cga ggg etc gag ccc ggc acc tgg tat tac 580fiT 
Lys Pro Gly Arg Ala Arg Gly Leu Glu Pro Gly Thr Tr? Tyr Itr 
8205 8210 8 Il5 

gat ccc age acc cat egg ctg gtg ctc ctg tec gcc gga gcc gga 58108 
Asp Pro Ser Thr His Arg Leu Val Leu Leu Ser La Gly Ea gS 
8220 8225 8230 

ctg gat cgc cgc ate cat gat ccg cac cag aac cag gcc ate ttc 581 ^ 

Leu Asp Arg Arg H e His Asp Pro His Gin Asn Gin IS lie Pne 
3235 8240 8245 

As P Se? S 111 p£e 5°° , CtC CtC 'f C gCC CgC atg ggg gcc 58198 

Asp ber Ala Ala Phe Ser Leu Phe Leu lie Ala Arg Met Gly Ala 

8250 8255 8260 

gtg gag ccg gta tac gcc gaa cat gcc ctg cac ttc gcc aca eta , fi ,^ 
Val Glu Pro Val Tyr Ala Glu His Ala Leu His Phe lit £? Leu 
8265 8270 8275 

gag gca ggg ctg atg acc cag ctt ctc gat ctg ggc gcg gct ccc 58288 
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Glu Ala Gly Leu Met Thr Gin Leu Leu Asp Leu Gly Ala Ala Pro 
82 80 8285 8290 

agt ggg ctg ggg ctg tgc cat ate ggc gac ctg gac ttc gec cag 58333 
Ser Gly Leu Gly Leu Cys His lie Gly Asp Leu Asp Phe Ala Gin 
8295 8300 8305 

lit a" 2? ff r tC ^ C Ca ° ttg gag gag gag cac ct 9 cfc 9 cac 58378 
Ala Arg Gly Leu Phe His Leu Glu Glu Glu His Val Leu Leu His 

8310 8315 8320 

2E r! 9 2?* gta Ct ° CCC aCt cgg gg g cag gag gcg gec 58423 
Ser Leu Val Gly Gly Val Leu Pro Thr Arg Gly Gin Glu Ala Ala 
8325 8330 8335 

teg gtg ccc get gag gga gga acg gag gcg egg caa ctg gcg cag 58468 
Ser Val Pro Ala Glu Gly Gly Thr Glu Ala Arg Gin Leu Ala Gin 
8340 8345 8350 

etc ctg caa cag gtg aag acg etc acg ccc gag gee gcg cga gcg 58513 
Leu Leu Gin Gin Val Lys Thr Leu Thr Pro Glu Ala Ala Arg aS 
8355 8 3 60 8365 

ctg etc gag gee cgc cgc ggc age aag ggg aga cca cat gag taa 58558 
Leu Leu Glu Ala Arg Arg Gly Ser Lys Gly Arg Pro His Glu 
8370 8375 

getgeacgag gagctggaga gcctggcgcc ggagcagegg gagctactcg cggccctgat 58618 

gaaagagcag ggcctggacg agggegcatt gctgatgccc gtggagcgca ageccgaggg 58678 

gctgccgctc tcgtccgccc ageagegcat gtggttcctg caacagctca ggectacgag 58738 

ccccttctac aacgtgcacg cggcgctccg getcaeggga aagctgaagg tggagtgcct 58798 

ggtgcacagc ctgaatgagt tcgtccggcg tcacgagccg ctgcgcacgg tgttcccctc 58858 

cgccggaggt caacccctgc agcgcatcct cgcgcccgca ccggctgcgc tegagcageg 58918 

cgacctgagc ggcgtgcccg cccaggagcg agaggecgag gtgtaccgcg cggtggagca 58978 

cgcagcgctc gcctcctttg atctggagcg agagccgccg tgccgcttcc tcctggtgcg 5903 8 

agtggagccg gaggaacacg tgctggtgtt cgccacgcac cacatcgcag cggatggctg 59098 

gtcgctcggc gtcttcgtgc aggagctgtg cgcgctgtac aeggctgetg tacacgegga 59158 

gccaccggcg ctgccaccgc tgeggctgea gtacgeggae ttcgccgcgt gggageggag 59218 

ccggctcaag ggtgggcgcg agegegaget gctcgagtac tggcaggagc agctggcagg 59278 

gctgccggac ctgagcaccc tgccgccaga gcggcccagg cccccgctgt ccaagcagga 59338 

gggcgccacg ttcgaattcg ccctgccacc acaccaggtg caggccctgc gctcacttgc 59398 

teaagegegg cgcacctcgc tcttctccgt gctgctcgcc gccttccaat ggctgctggc 59458 

""ZC^Z^SSs £« cca Srsacg acgtggccct gggcatgccc atcgccaacc gcaaccgcaa 59518 

« tggsa ggcctcatcg gctgcttcgc cagtaccctg gtgctgcgcg cgaagccctc 59578 

•r-7-c-cgaoc gccttcacct catggctcgc ccaggtgtcc gagcagctcc acggcgccct 59638 



WO 2004/022586 PCT/EP2003/009780 

cgagcaccag gaggtgcctt tcgaacgatt ggtggaggtg ctccagcccc ggcgccgcat 59698 
ggatcggcac ccgctcttcc agatcttcct cgccatgcag cagcacccgc tgcggcgcgc 59758 
cgagctgcca ggcctgctgc tctcggagtt cccccttcgc agccgcgtgg cccgcttcga 59818 
tctggagttc cacctctggg agtctgccca gggcgtcgag ggcaccctca tctacgacgt 59878 
ggacctgtat ggccaggagg gcgtagcgca actggtgcgg cgctacgtga gcctgctcga 59938 
agccgtggcg gcggacccga cgcggacact gggggagctg gcgggtgaga cgctggcccc 59998 
gcccgtacgg aagcaggttc tcgccctgtc ggagtgtccg cccctcccac gcccgagcac 60058 
agcccctcgg acactggccg aggcacttct acagaccgcc gagcgcttcc ccacggccac 60118 
ggtcagcttc gtccaggcac agggctcttg cacggcctgg actctgccag agctggtgga 6017 8 
gcgggcccgg cgcctgcagg ccggcttgcg gcagtgggga ttgcggccgg gcgacagtct 60238 
ggtgctggtg ctggggaggg aggaagagac ggtagaggca ctctgggcct gcgtgctggc 60298 
gggagtggcg cccctggtgc taccggcgcc gcccgcgaga gcggaggcga gtcccgcgct 60358 
ctcacgtctg cggcatgccc ggcagctgct gggtgggccc cgggtcctca cccggcagga 60418 
gatgctgccc gatcttgcca ggcagcttca ggtctcgccc accgccgaca tcctgggggc 60478 
cgtggaggag ctgagagcca cgggtggcga ggcgcccctg cccccagggc gcatggatga 60538 
cgtggccctg ctcaacctca cctcgggcac caccggcaag gccaagtgcg ccatgctcac 60598 
gcaccgcaac ctgctggtgc gcctggaggc caccaacgtc gtctacgagt cccagccgct 60658 
cgagcgcggc ctcgtctggt tgcagctgca caacattggc gccctgtccg agtaccacct 60718 
ccgcccgctc tgcgccggca tgcacacctt ccacgctccc accgaggagg tgctcgccga 60778 
gcccctgcga tggctggagt ggctcgagcg gtatggcatt gcacagacgt gggcgcccag 60838 
cttcgcctac agccacctgc tggagcggct gcgcaaggtg gaggaccggc gctggagcct 60898 
gggaggcgtg cgcgtgctgc tcagcgccgg agagcagatc tccgcgccca tggtggagga 60958 
gctgatgcgg aggttggctc cctcgggagt gcgggaggat gccttcgtcg ctgcctgggg 61018 
aatgaccgag acggcttcgg gggtgaccta tgcgcggcgg cccggaacgc cgccccgcat 61078 
gcacaccctg gagcgcgcga gcttgagcgg tccgcttcgg cacgcggcac ccgcttcgcc 61138 
caccgcgctg cggctgatgg atgtaggagc ccccattgcc ggcaccgccc tgcgggtggt 61198 
ggacgcgagc ggcgagctgc tctccgagga gtgcgtgggg cgcattcaag tgcgcggcga 61258 
gatgatctct cccggctact acggggaccc gaaggcctcg gcggcgctgc tcaccgccga 61318 
cggctggctg gagacaggcg atctgggctt cctctccgag ggagcgctga ccattaccgg 61378 
ccgcgccaag gatctggtca tcatccacgg caccaacttc tcctgctacg agatcgagtc 61438 
cgcggtggag caggtggagg gagtggcgcc ctcctcggcg gcggcggcgg cggtgcggat 61498 
gctcgagggg agcagggagg agctggcggt cttcttcgtg cccacggagg gactcgctcc 61558 
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gcaaccccct gcctcactcc tgtcgcgcat ccgccagcag gtgctcgagc aggtgggggt 61618 

gcggatcgat catctcatcc cgctcgagcc tcaccagctc ccacgcacgg agggaggcaa 61678 

gctgcgccgc tcggagctga gggcgcgctt cgaagcaggt gagctgaggg cgccgcaacc 61738 

cgctccggtc ccgagcccct cgcgcccgct ' ggagcagctc atcgcctccg tctgggcgga 61798 

ggtgctcgag caccaggaca tcgcgcccga ggccagcttc ttcgacctgg gcggcaactc 61858 

catcctcctg gtgcgcgtgg agcgggcact gcgcgcgcgg ctcggcctgg agctgacgct 61918 

catggacctg ttcgcctacc ccacggtgca ctcgctggcg gactacctgg agcctcgagc 61978 

cgcccagctg ccagcacagg cctcccctcc cacgcaggcc gagcgccggc gggggatgcg 62 038 

cggcgcggcg ctggagcaga gacgcactcg ccgccaggca caacgggacg aggactagac 62098 

62144 



acgc atg age cac gac acg cag etc gag gga gec gta gcg gta gtg 
Met Ser His Asp Thr Gin Leu Glu Gly Ala Val Ala Val Val 
8380 8385 8390 



t ? ^ g g ? g ° gC ttC CCC Cag gcc ccc acc ct 9 9 & 9 tac 62189 

Ala Met Ala Gly Arg Phe Pro Gin Ala Pro Thr Leu Glu Asp Tvr 
83 95 8400 8405 

tgg cgc ate ate cac gag ggc gtg gac gcc ttt acc cag etc acg 62234 
Trp Arg He He His Glu Gly Val Asp Ala Phe Thr Gin Leu Thr 
8410 8415 8420 

gac gaa cag etc ctg gcc tec gga gtg ggg ccc teg etc ctg cgc 62279 
Asp Glu Gin Leu Leu Ala Ser Gly Val Gly Pro Ser Leu Leu Arg 
8425 8430 8435 

m« o° C o gt ta ° 9tg Cgg Cg ° gcc gcc ctg ctg 9ac agg atg gac 62324 
o??„ Y yr Val Arg Ala Ala Leu Leu Asp Arg Met Asp 

8440 8445 8450 

^ gSC g f C g f g ttC ttb gg ° ttc tcc CC ST c ^a gag gcg gag 62369 

Gin Phe Asp Ala Ala Phe Phe Gly Phe Ser Pro Arg Glu Ala Glu 
8455 8460 8465 

gtg etc gac cca cag cac egg etc ttc ctg gag tgc gcg cac gag 62414 
Val Leu Asp Pro Gin His Arg Leu Phe Leu Glu Cys Ala His Glu 
8470 8475 8480 

gcg ctg gag egg gcc ggc cat ggc tcc gag cgc gca agg ggg cgc 62459 
Ala Leu Glu Arg Ala Gly His Gly Ser Glu Arg Ala Arg Gly Arg 
8485 8490 8495 

gtg ggg gtc ttc gcc age gcc age etc aac age tac tac ctg cat 62504 
Vai Val phe Ala Ser Ala Ser Leu Asn Ser Tyr Tyr Leu His 

8500 8505 8510 

tcc ctg cac ggg aac gcc egg ctg agg gag gtg ctg gga gac ttc 62549 
Ser p®?,. Hls Asn Ala Arg Leu Arg Glu Val Leu Gly Asp Phe 
8520 8525 

cag czc gcc ate gcc aac gac aag gac ttc ctg ccc acc cgc gtc 62594 
Gxn Lieu_ Ala He Ala Asn Asp Lys Asp Phe Leu Pro Thr Arg Val 
-°- '-' 8535 8540 
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teg tac aag ctg ggc ctg cgc ggg ccc age gtg gcg gtg cag acg 
Ser Tyr Lys Leu Gly Leu Arg Gly Pro Ser Val Ala Val Gin Thr 
8545 8550 8555 



gca tgc 
Ala Cys 
8560 



tec age teg ctg gtg gee gtc cac etc gcg tgc cag age 
Ser Ser Ser Leu Val Ala Val His Leu Ala Cys Gin Ser 
8565 8570 



ctg etc aat ggt gag tgc gec ctg gcg etc gcg ggc ggc tec tct 
Leu Leu Asn Gly Glu Cys Ala Leu Ala Leu Ala Gly Gly Ser Ser 
8575 8580 8585 

etc tec gtg ccc cag gec cag ggc tac etc tac cag gag gga ggc 
Leu Ser Val Pro Gin Ala Gin Gly Tyr Leu Tyr Gin Glu Gly Gly 
8590 8595 8600 

att gee tea ccg gac ggg tac tgc cgc ccc ttc gac gcg gcg gcg 
He Ala Ser Pro Asp Gly Tyr Cys Arg Pro Phe Asp Ala Ala Ala 
8605 8610 8615 

gcg ggc ace aac cga ggc aac ggc gtg ggg gtg gtg ctg etc aag 
Ala Gly Thr Asn Arg Gly Asn Gly Val Gly Val Val Leu Leu Lys 
8620 8625 8630 



ctg etc gag gag gcg ctg gcg gac gga gac ace ate 
Leu Leu Glu Glu Ala Leu Ala Asp Gly Asp Thr He 
8635 8640 8645 



cac gee gtc 
His Ala Val 



ate cgc ggc teg gcg gtg aac aac gac ggg gca cac aag ate ggc 

He Arg Gly Ser Ala Val Asn Asn Asp Gly Ala His Lys He Gly 
8650 8655 8660 

tac ace gcg cca age gtg gag ggg cag gee teg gtc ate tec gag 

Tyr Thr Ala Pro Ser Val Glu Gly Gin Ala Ser Val He Ser Glu 
8665 8670 8675 

gcg ctg gag gtg gcg gec gtc tec gcg gac age ate ggc tac gtg 

ofo. G1U Val Ala Ma Val Ser ^ As P Ser He Gly Tyr Val 
8680 8685 8690 

gag gca cac ggc acc gec acg gec ctg gga gat ccg ate gag gtc 

olL HlS GlY Thr Ala Thr Ala Leu As P Pro He Glu Val 

8695 8700 8705 

t tC *f 3 Cgg gCC ttC CgC aag Cac acg Sag cgc cgc ggc 

oif„ Thr Arg Ala Phe Ar S L ^ s His Th ^ Glu Arg Arg Gly 

8710 8715 8720 



tac tgc 
Tyr Cys 



get etc ggc teg gtc aag gee aac ate gga 
Ala Leu Gly Ser Val Lys Ala Asn lie Gly 
8725 87 30 8735 



cac ctg gat 
His Leu Asp 



ti 9 lit S? C at ° gC ° ag ° ° tg atC aag gcc 3tg ctg gtg ctg 

P^n GlY 116 Ala Ser LeU Ile L * s Ala Val Leu Val Leu 

8740 8745 8750 



gag cgc cgc cag etc ccg cca tgc ccg cac ttc acc 
Glu Arg Arg Gin Leu Pro Pro Cys Pro His Phe Thr 
8755 8760 8765 



tec ccc aac 
Ser Pro Asn 
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srgg cag 

Gly Gin 
8785 



ccg tgg gag ccg gtg gac ggc ccg egg cgc 
Pro Trp Glu Pro Val Asp Gly Pro Arg Arg 
8790 8795 



ffcg ggc gtc 
Ala Gly Val 



age tec ttc ggt att ggt ggc acc aac gtg cat gtg gta etc oao 

2E„ Phe Gly Ile Gly Gly Thr Asn Val His Val v2 Leu £5 

8800 8805 8810 

G?u ??? p C9 « CC ^ 9g CCg gCg CCg Cgg C9g gca gcg egg ccc tac 

3 Pr ° Ala Pr ° Arg ^ g Ala Ala Ar S P« Tyr 
8815 8820 8825 y 

cag etc ctg ccg ctg tea gca cgc acg gag cgc gee egg gag gag 

SXn SU Pr ° LSU SSr Ala Arg Thr Glu Ala Glu Glu 

8830 8835 8840 



gtc gag gee egg ctg cgc gag cac ctg cgc cag cag 
Val Glu Ala Arg Leu Arg Glu His Leu Arg Gin Gin 
8845 8850 8855 



ccc cag gag 
Pro Gin Glu 



ccg ctg gag gag gtg gee cac acg etc cag gta ggc ccc cac car 

8«6n HiS Thr L6U G1 - ?al ^ Arg Arg uts 

8860 8865 8870 

atg gee tgg cgc aca gcg ctg gtg age age age cct gee cag gcg 
Met Ala Trp Arg Thr Ala Leu Val Ser Ser Ser Pro Ala 22 K 
8875 8880 8885 

gta gag ctg ctg gag agg cgc cga ccc gag gcg ctg ctg cgt ggc 

XSn U G1U Arg *** *** Pr ° Glu ^ Leu 2g lly 

8895 8900 

ri^ g ? 9 g f° ° ag gCC Cgc tcc gtg gcc ttc c tt eta ccg ggc 

So, Ala Gln Ala **" Ser Val Ala Phe Leu ^eu Pro Sy 

8905 8910 8gi5 

r?« o° a Sf g taC gtg ggc atg gga CTCCT CTC « etc cat gag tcc 

s£n Gln ^ Val Gly Met Gly Ala Ala Leu "is SlS ser 

8920 8925 8930 

gag gcg ccc ttc cgt gaa cag gtg gat etc tgt gee ggg aaa etc 
Glu Ala Pro Phe Arg Glu Gln Val Asp Leu Cys Ala lly Lys L^u 
8935 8940 8945 

£S lit Si? T CtC ^ ttg ^ gtg Cgp CSC Ctg ctg tac °°t ggc 
G Sn GlY LeU ASP Val Arg Leu Tyr Pro Gly 

8950 8955 8960 



gag acg egg 
Glu Thr Arg 



ccg egg gag egg gaa tgg gee cag gag cgc ctg coo 
Pro Arg Glu Arg Glu Trp Ala Gln Glu Arg Leu Arg 
8965 8970 8g » 5 

gta tgc cag ccg gtg etc ttc acc gtg gag tac gca eta acc caa 

^ Tbo Gln ^ L6U PhS ^r Val Glu Tyr S Leu Sa Sg 

8985 8990 



ctg ctg gag gcc tgg ggg ata cgc ccc acc gcg eta tto aar ran 

Leu Leu Glu Ala Trp Gly lie Arg Pro Thr 111 LeS Hi lly Ss 
8995 9000 9005 

tit III III %T gt ? g ?° gCg tg ° CtC gcg » c Srtg ttc tcc 

Ser Leu Gly Glu Tyr Val Ala Ala Cys Leu Ala Gly Val Phe Ser 
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9015 



9020 



ctg gaa gag gcg ctg gag gtg gtg gcc gca cgt gga 
Leu Glu Glu Ala Leu Glu Val Val Ala Ala Arg Gly 
9025 9030 ■ 9035 



cag etc atg 
Gin Leu Met 



ggc age atg ccc ccg ggg ggc atg etc gcc gtg ggg etc cca gcc 
Gly Ser Met Pro Pro Gly Gly Met Leu Ala Val Gly Leu Pro Ala 
9040 9045 9050 

cag gaa gtg gaa ccg etc eta ccc gcg gca ctg gcc ctg gcc gcg 
Gin Glu Val Glu Pro Leu Leu Pro Ala Ala Leu Ala Leu Ala Ala 
9055 9060 9065 

cac aac age ccc cag gcc tgt gtg gtg gcg ggc ccc ace gag ccc 
His Asn Ser Pro Gin Ala Cys Val Val Ala Gly Pro Thr Glu Pro 
9070 9075 9080 

ctg gcc gag ctg cgc gca cga etc gag cag egg ggc ate gcg tgc 
Leu Ala Glu Leu Arg Ala Arg Leu Glu Gin Arg Gly lie Ala Cys 
9085 9090 9095 

acg cct ctg get gtc tct cac gcc ttc cac teg ccg atg atg gag 
Thr Pro Leu Ala Val Ser His Ala Phe His Ser Pro Met Met Glu 
9100 9105 9110 

ccg gcg gtg gcc cca ttc gtc gcc egg ctg cgc egg atg cat ctg 
Pro Ala Val Ala Pro Phe Val Ala Arg Leu Arg Arg Met His Leu 
9115 9120 9125 

egg cct ccg age ctg ccc ttc ate tec aat gtg act ggc acc tgg 
Arg Pro Pro Ser Leu Pro Phe lie Ser Asn Val Thr Gly Thr Trp 
9130 9135 9140 



ata gag gcc gag gag gcg acc tea ccg gag tac tgg 
He Glu Ala Glu Glu Ala Thr Ser Pro Glu Tyr Trp 
9145 9150 9155 



ggc egg cac 
Gly Arg His 



ctg ctg cag ccg gtg cgc ttt 
Leu Leu Gin Pro Val Arg Phe 
9160 9165 



gcc cag ggg ctg gag egg ttg tgc 
Ala Gin Gly Leu Glu Arg Leu Cys 
9170 



gag ggt gtg cag ccg egg ctg ctg ttg gaa gtg ggg 

Glu Gly Val Gin Pro Arg Leu Leu Leu Glu Val Gly 
9175 9180 9185 

aca ctg ggg egg ctg gcg gcg cgc caa age tec ggc 

Thr Leu Gly Arg Leu Ala Ala Arg Gin Ser Ser Gly 
9190 9195 9200 

gtg gtg tec acg ctg ggc tec age egg gag gaa age 

Val Val Ser Thr Leu Gly Ser Ser Arg Glu Glu Ser 
9205 9210 9215 



ccg ggc cac 
Pro Gly His 



ccc gtg cag 
Pro Val Gin 



teg gag gtg 
Ser Glu Val 



gag egg ctg ctg acg gca ate ggg cga ctg tgg gtg gag gga gcc 
Glu Arg Leu Leu Thr Ala He Gly Arg Leu Trp Val Glu Gly Ala 
9220 9225 9230 



gag gtg gac tgg get ggg etc cac cgc ggc gaa cgc 
Glu Val Asp Trp Ala Gly Leu His Arg Gly Glu Arg 
9235 9240 9245 



egg agg cgc 
Arg Arg Arg 



64079 



64124 



64169 



64214 



64259 



64304 



64349 



64394 



64439 



64484 



64529 



64574 



64619 



64664 



64709 



gcg gtg eta ccc acc tac cct ttt gag cac cag C g c tac tgg gtg 



64754 



WO 2004/022586 



63 



PCT/EP2003/009780 



Ala Val Leu Pro Thr Tyr Pro Phe Glu His Gin Arg Tyr Trp Val 
9250 9255 9260 

n, 9 9 f C CCC Cag CCt tcc cct ccc gag cgg cct 339 aca cct 

Glu Thr Ala Pro Gin Pro Ser Pro Pro Glu Arg Pro Gly Thr Pro 
9265 9270 9275 

??« ni 9 * 3C S Ct gt ? aC ° agC agc ttC tac gtg cca TOt tgg age 
Ala Glu Ser Pro Val Thr Ser Ser Phe Tyr Val Pro Gly Trp Ser 
9280 9285 92 90 

*™ g ? 9 g ? fc Ctg CCC tCg gcg gcc ccc tcg a ^ a c ra fftg ggc egg 

3 ™ Ala LeU Pr ° Ser Ala Ala Pro Ser A ^ Arg Val Gly Arg 
9295 9300 9305 

ctg ctg ttg ttg gcg gag gcc cat ggg tgg agc cag ggg etc gcg 

LSU LSU Ala Glu Ala His Tr P Ser Gin Gly Leu Ala 

9310 9315 9320 

gag egg etc egg gcc egg gga cac tcc gtc ace etc gtc gag cca 

LSU Arg Ala Arg Gly His Ser Val Thr Leu Val Glu Pro 
9325 9330 9335 

rit rf 9 ^ g T ° tC g f g CgC ° tg aCC Ccg gag cac tgg cgc ct 9" ccc 
Y G ^ g LeU Glu Arg Leu Thr Pro Glu His Trp Arg Leu Pro 
9340 9345 9350 

ccg ggc cgc cgc gag gac ttc caa cgc ctg ctg gag gac tcc gga 

«L Arg Arg G1U ASP Phe Gln Arg Leu Le ^ Glu Asp Ser Gly 
9355 9360 9365 

gag ctg ccc teg agg gtg etc cac etc tgg ctg ctg ggg age agc 

o?™ Arg Val LSU H±S Leu Tr P Leu Le » G?J Ser Ser 

9370 9375 938Q 

cag age ccc cag gag cgc ggc ttc tac ace ctg ctg gcg ctg gcc 

XL Pr ° Gln G1U Arg Gly Phe ^ r Thr Leu Leu Ala Leu Ala 
9385 9390 9395 

ri 9 ^ ^ g ?° Ca ° ggC aCC cgg ccc gcg gtg ga ^ ct c acg 

LSU GlY Ala HlS Gly Thr Arg Pro A ^ a ' Val Glu Leu Thr 
9400 9405 9410 

S? fr t9 ^ C gat C f g Ct ° taC gcg gtg gca gag ggc ^ <=cg ata 
Val Val Thr Asp Gln Leu Tyr Ala Val Ala Glu Gly Glu Pro He 
9415 9420 9425 

cag ccg etc aag gcg etc ctg cag ggc cct get tcc gtg ctt ccc 
Gln Pro Leu Lys Ala Leu Leu Gln Gly Pro Ala Ser Val Leu Pro 
9430 9435 9440 

cag gaa etc ccc ggg tgc acg tgc cgc etc gtg gac atg gcc ctg 

U " L " n^L LSU Pr ° Gly Cys Thr Cys ^ Leu Val As P Me t Ala Leu 
9445 9450 9455 



64799 



64844 



64889 



64934 



64979 



65024 



65069 



65114 



65159 



65204 



65249 



65294 



65339 



CCCf CCS 

Pro Pro 



ggc ggc gtg gcg gag gag ctg ctg gag egg ctg gtg gcg 
Gly Gly Val Ala Glu Glu Leu Leu Glu Arg Leu Val Ala 
9465 9470 



tcc acc gcg tcc gaa cgc teg gtg gcg tac cgc acre 
■rxu G±u Ser Thr Ala Ser Glu Arg Ser Val Ala Tyr Arq Glv 

9480 9485 



65384 



65429 
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gcg get cgc tgg gtg cgc gag ttc gtc ccc gtg ccg ctg ccc ccg 
Ala Ala Arg Trp Val Arg Glu Phe Val Pro Val Pro Leu Pro Pro 
9490 9495 9500 



ccc gag cct teg cag etc ccg ctt egg cag aga ggg 
Pro Glu Pro Ser Gin Leu Pro Leu Arg Gin Arg Gly 
9505 9510 9515 



att gtc ggc ggc ctg ggg ggc 
He Val Gly Gly Leu Gly Gly 
9520 9525 



gta ggg etc gcg ctg 
Val Gly Leu Ala Leu 
9530 



ctg gcg egg cgc gtg teg gcg egg ctg gtg etc acg 
Leu Ala Arg Arg Val Ser Ala Arg Leu Val Leu Thr 
9535 9540 9545 



gtg tat ttg 
Val Tyr Leu 



gca gag cac 
Ala Glu His 



ggc cgc teg 
Gly Arg Ser 



ccc acg cca ccg cgc gag age tgg age gca tgg ctg ggc act ccc 
Pro Thr Pro Pro Arg Glu Ser Trp Ser Ala Trp Leu Gly Thr Pro 
9550 9555 9560 



acg egg ctg egg etc tea cag gag etc gag tgg ctg 
Thr Arg Leu Arg Leu Ser Gin Glu Leu Glu Trp Leu 
9565 9570 9575 



cgc ggt gee 
Arg Gly Ala 



gee gag cag ate gag cac cag egg ccc ctg cac tec ctg gee gac 
Ala Glu Gin lie Glu His Gin Arg Pro Leu His Ser Leu Ala Asp 
9580 9585 9590 



agt ccc 
Ser Pro 
9595 



ggc ctg gaa gag teg ctg cgc ctg ctg tgc gee age tac 
Gly Leu Glu Glu Ser Leu Arg Leu Leu Cys Ala Ser Tyr 
9600 9605 



etc tac cac ttt etc ttc cct ctt cag gcg cct ctg aga ctt gga 
Leu Tyr His Phe Leu Phe Pro Leu Gin Ala Pro Leu Arg Leu Gly 
9^10 9615 9620 



gag ccg agg gcg atg cag gca ttg cgt gag cgc ctg 
Glu Pro Arg Ala Met Gin Ala Leu Arg Glu Arg Leu 
9625 9630 9635 



ggg ctg cag 
Gly Leu Gin 



ccc ggc ttc gag cgc etc ttc tec ttc atg ate ggc acc ctg gag 
Pro Gly Phe Glu Arg Leu Phe Ser Phe Met He Gly Thr Leu Glu 
9640 9645 9650 



cag gcg aag ctg att cga gtg gga gag ggc atg etc 
Gin Ala Lys Leu He Arg Val Gly Glu Gly Met Leu 
9655 9660 9665 



gag ggt cag 
Glu Gly Gin 



gtg gag cca gee cac gtg ccc aca cct cga gca ctg cac gag cga 

Val Glu Pro Ala His Val Pro Thr Pro Arg Ala Leu His Glu Arg 
9670 9675 9680 

ctg etc gaa ggc tac ccg gag gtg tea ggc ctg ctg gag ctg ctt 

Leu Leu Glu Gly Tyr Pro Glu Val Ser Gly Leu Leu Glu Leu Leu 
9685 9690 9695 



gag cat tgt gtg cgc cac tac cct gag gtg etc cga 
Glu His Cys Val Arg His Tyr Pro Glu Val Leu Arg 
9700 9705 9710 

ccg gee etc agt gtg etc tac ccc teg ggc cgc tec 
Pro Ala Leu Ser Val Leu Tyr Pro Ser Gly Arg Ser 
9715 9720 ~ 9725 



ggg age ctg 
Gly Ser Leu 



gag gag gac 
Glu Glu Asp 



65474 



65519 



65564 



65609 



65654 



65699 



65744 



65789 



65834 



65879 



65924 



65969 



66014 



66059 



66104 



66149 
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tec gca cga age ggg gtc gag tgg age age atg ggg cag tgc gtg 

££n Arg Ser Gly Val Glu Trp Ser Ser Met G1 y Gin Cys Val 
9730 9735 9740 



gcg ctg 
Ala Leu 
9745 



ctg egg cat ttc etc gee cgc cat gee gag cgc acg cag 
Leu Arg His Phe Leu Ala Arg His Ala Glu Arg Thr Gin 
9750 9755 



ggg cgt ace ctg cgc ate etc 
Gly Arg Thr Leu Arg He Leu 
9760 9765 



gag gtg ggc ggc ggc agt ggc gtg 
Glu Val Gly Gly Gly Ser Gly Val 
9770 



gtg etc cag gee ctg ctg ccg ctg ctg egg cac cac 
Val Leu Gin Ala Leu Leu Pro Leu Leu Arg His His 
9775 9780 9785 



ccg gtg gag 
Pro Val Glu 



tac cac ttc acg gat att ggc ccc tec ttc gtc cgc gec atq aaa 

^ Q7on ^ ASP Ile Gly Pr ° Ser Phe ?al Ala Se? llu 

9790 9795 9800 

??? ° gC ggC Ct ° aCC ttc ctc cac acc tc S 

Y ooL GlY *** Arg Gln Gly Leu Thr Phe Leu His Thr Ser Val 
98 °5 9810 9 815 

ctg gac ate tec cgc cct ccc cca gag cag ggc cat ccg cct ggc 

oo?o 6 Ser Arg Pr ° Pr ° Pro Glu Gln Gl y Hi s Pro Pro Gly 
9820 9 825 9830 

teg tat gat ctg gtg att gee ctc aac gtg gtg cac gee ace ccg 

IoT. ASP LeU Val Ile Ala Leu Asn Val Val His Ala Thr Pro 
9835 9840 9845 

Ha ^ g l° C Ctg gCC CaC CtC gag agc Ctg ctc ^tc cca 

g YSn G n Ser LeU Ala His Leu Glu Ser Leu Leu Val Pro 

9850 9855 98 60 



ggc ggc cac ctg tgc ctg gtg gag acg gtg aag cag 
Gly Gly His Leu Cys Leu Val Glu Thr Val Lys Gln 
9865 9870 9875 



cag ccc tgg 
Gln Pro Trp 



vll SS S r^ C H gt " g gCC gag ggg tg S tW a ^ tac gag 

o««n le TrP GlY LeU Ala Glu G1 ^ Tr P Tr P Ser Tyr Glu 

9880 9885 9890 

r ^ ^ g ° ^ C Cg ° tCt CCC ttg Ctg gag gta ^ c ^ac tgg 

P o L eU Arg Thr Arg Ser Pro Leu Leu Glu Val Gly Asp Trp 

989 5 9900 9905 

Glu St r Ctg ^ C gaC gt ? ggC ttfc gcc gag gtg ^ag fftg ctg 

oof n eU Arg ASP Val Gly Phe Ala Glu Val Glu Val Leu 

"10 9915 9920 



cca gcg 
Pro Ala 
9925 



gcc gtg gag cag cgc tea cgc tgg gac aac gtg ctc ctc 
Ala Val Glu Gln Arg Ser Arg Trp Asp Asn Val Leu Leu 
9930 9935 



r^ " fi ^ Cgg CCC ggC gag agc ggg ctt ca S cca gtg ggc ggg 

alt Gln Arg Pr ° Gly Glu Ser Gly Leu G ln Pro Val Gly Gly 
"40 9945 9950 

cgc gcg gcc atg cag gag egg ate cgc egg ctg egg gcc att gaa 

*rg « A_La Met Gln Glu Arg Ile Arg Arg Leu Arg Ala Ile Glu 



66194 



66239 



66284 



66329 



66374 



66419 



66464 



66509 



66554 



66599 



66644 



66689 



66734 



66779 



66824 



66869 
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9960 



9965 



gaa gcg gga ggc gag gtg ctg 
Glu Ala Gly Gly Glu Val Leu 
9970 9975 



ccc ctg gtg gcg gac gtc acc gat 
Pro Leu Val Ala Asp Val Thr Asp 
9980 



egg gag 
Arg Glu 
9985 

ggg gcg 

Gly Ala 

10000 



cgc atg ggc gag gtg etc gcc gag gtg aag cgc cgc cat 
Arg Met Gly Glu Val Leu Ala Glu Val Lys Arg Arg His 
9990 9995 



etc cat ggg gtg ate 
Leu His Gly Val lie 

10005 



cac gcc gcg ctg gtg 
His Ala Ala Leu Val 

10010 



ctg gag gat 
Leu Glu Asp 



ggg ctg atg cag etc aag acg cgc gag teg gcc ggg cgc gtg ctg 

Gly Leu Met Gin Leu Lys Thr Arg Glu Ser Ala Gly Arg Val Leu 
10015 10020 10025 

gcc tec aag gtc gag ggc acc ctg gtg ctg gac gaa ctg ctg cgc 

Ala Ser Lys Val Glu Gly Thr Leu Val Leu Asp Glu Leu Leu Arg 
10030 10035 10040 

gac gag cca etc gac ttc ttc gtc ctg tgc tec teg ctg age gca 

Asp Glu Pro Leu Asp Phe Phe Val Leu Cys Ser Ser Leu Ser Ala 
10045 10050 10055 



66914 



66959 



67004 



67049 



67094 



67139 



ttg etc ggc gcg ctg ggg cag gcg gac tat gcc gcc gcc agt gcc 67184 
Leu Leu Gly Ala Leu Gly Gin Ala Asp Tyr Ala Ala Ala Ser Ala 
10060 10065 10070 



ttc etc gac gcg tat gcg cac tea cag cga ggg cgc acg gac cgc 
Phe Leu Asp Ala Tyr Ala His Ser Gin Arg Gly Arg Thr Asp Ara 
10075 10080 " 10085 

cgc acc ate tec atg gac tgg gat cgc tgg ctg gag gtg ggc gcc 
Arg Thr lie Ser Met Asp Trp Asp Arg Trp Leu Glu Val Gly Ala 
10090 10095 10100 

gcc atg cgc ctg ggg ctg ggg ctg gcg gcg gga gca ctg ggg ctg 
Ala Met Arg Leu Gly Leu Gly Leu Ala Ala Gly Ala Leu Gly Leu 
10105 10110 10115 

cag cgc acg gca ccc gga gag tac acg gtg cga tgg cag gcc gag 
Gin Arg Thr Ala Pro Gly Glu Tyr Thr Val Arg Trp Gin Ala Glu 
10120 10125 10130 



cgc tgc 
Arg Cys 

10135 



tgg tgg ctg gac gag cac cgc ctg gaa gga egg gcc acg 
Trp Trp Leu Asp Glu His Arg Leu Glu Gly Arg Ala Thr 
10140 10145 



ctg ccg ggg gtg gcg tac ctg gag ctg gtg cga gcc gcc ctg gtc 

LSU fn? E « ° ly Val Ala ^ Leu Glu Leu Val A la Ala Leu Val 

10150 10155 10160 

cag gaa ctg ggc gag gcc ccc gtg gag ctg gag caa ctg gtg ctg 
Gin Glu Leu Gly Glu Ala Pro Val Glu Leu Glu Gin Leu Val Leu 
10165 10170 10175 



etc tct ctg ctg gaa gcg ccc gcg ggg gag gag gtg 
Leu Ser Leu Leu Glu Ala Pro Ala Gly Glu Glu Val 

10180 10185 10190 



gag gtg cga 
Glu Val Arg 



67229 



67274 



67319 



67364 



67409 



67454 



67499 



67544 



ctg cga ccg gag gat gag ggc tat gcg ctg gag ate cgc 67589 
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Leu Arg Pro Glu Asp 



Phe His 
10195 



10200 



67 

Glu Gly Tyr Ala Leu 

10205 



age egg get ggc ggc etc gcg 
Ser Arg Ala Gly Gly Leu Ala 
10210 10215 

atg ggc egg gtg agg atg ctg 
Met Gly Arg Val Arg Met Leu 
10225 10230 

cac ccg eta egg gag ctg gag 
His Pro Leu Arg Glu Leu Glu 
10240 10245 

ccg agg gag cat gag cca get 
Pro Arg Glu His Glu Pro Ala 
10255 10260 

gee ctg ggc egg cgc tgg tec 
Ala Leu Gly Arg Arg Trp Ser 
10270 10275 

aag gga gaa gag ggc ctg gec 
Lys Gly Glu Glu Gly Leu Ala 
10285 10290 

gee gag gat ctg egg caa tgg 
Ala Glu Asp Leu Arg Gin Trp 

10300 10305 



aat ggg ggc tgg cgc 
Asn Gly Gly Trp Arg 

10220 



cct cgc ggg age get 
Pro Arg Gly Ser Ala 

10235 



gaa egg etc ggc ttg 
Glu Arg Leu Gly Leu 

10250 



cag ggc ccc cgc gag 
Gin Gly Pro Arg Glu 

10265 



teg ctg age tgg cac 
Ser Leu Ser Trp His 

10280 



gcg gcg ace ggg ttt gca ccg 
Ala Ala Thr Gly Phe Ala Pro 

10315 10320 

ctg tec tat ggc aac acc cgc 
Leu Ser Tyr Gly Asn Thr Arg 
10330 10335 

ctg tac age cac ate cgc egg 
Leu Tyr Ser His lie Arg Arg 
10345 10350 

gga gtg gtg egg ctg gag gtg 
Gly Val Val Arg Leu Glu Val 
10360 10365 

gag ctg ctg tgc gtg gag gag 
Glu Leu Leu Cys Val Glu Glu 
10375 10380 

gac teg ttg get egg cca cag 
Asp Ser Leu Ala Arg Pro Gin 
10390 10395 

gca gtg ctt cca cga cca ggc 
Ala Val Leu Pro Arg Pro Gly 
10405 ~ 10410 

ccc ctg gag cgc ctt ccg ccg 
Pro Leu Glu Arg Leu Pro Pro 
10420 10425 



ttg ate gag etc ccc 
Leu He Glu Leu Pro 

10295 

ccc ctg cac cct gec 
Pro Leu His Pro Ala 

10310 

ctg ccc ccc ggt gec 
Leu Pro Pro Gly Ala 

10325 

ate cac ggc ccg ctg 
He His Gly Pro Leu 

10340 

ctg gag ccc gec agt 
Leu Glu Pro Ala Ser 

10355 

cgc etc atg gac ggc 
Arg Leu Met Asp Gly 

10370 

ttc gtg etc cgc egg 
Phe Val Leu Arg Arg 

10385 

ccc get ggc aag get 
Pro Ala Gly Lys Ala 

10400 

gcg etc gac teg ctg 
Ala Leu Asp Ser Leu 

10415 

cag gag gga gag gtg 
Gin Glu Gly Glu Val 

10430 
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Glu He Arg 



age cat gcg 67634 
Ser His Ala 



cga ccc acc 67679 
Arg Pro Thr 



acc cag gcg 67724 
Thr Gin Ala 



ctg gca cct 67769 
Leu Ala Pro 



cgc gcg tgg 67 814 
Arg Ala Trp 



gag gag ctg 67859 
Glu Glu Leu 



ctg ctg gac 67904 
Leu Leu Asp 



tgg ctg ccg 67949 
Trp Leu Pro 



ccg egg cag 67994 
Pro Arg Gin 



gee cag gec 68 03 9 
Ala Gin Ala 



gaa ggg egg 68084 
Glu Gly Arg 



gtg gag gtg 68129 
Val Glu Val 



ccg cgc cag 68174 
Pro Arg Gin 



cga ctg caa 68219 
Arg Leu Gin 



gag gtg cag 682 64 
Glu Val Gin 
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Si? ?? C g f g ggC Ct ° aat ttc aag gac gcg ctg ct ^ 3cg ctg 683 09 

Leu Ala Ala Gly Leu Asn Phe Lys Asp Ala Leu Leu Ala Leu 

10435 10440 10445 

ggg gcg ttg cca gtg gag ctg gcc aac ggc gcc cca ctg gcg ctg 68354 
Gly Ala Leu Pro Val Glu Leu Ala Asn Gly Ala Pro Leu Ala LeS 
10450 10455 10460 

lit SS r ag n g ° g ? g g ? C at ° atC tCC gCg gta ggt cca <*tg 68399 

Gly val Glu Cys Ala Gly He He Ser Ala Val Gly Pro Gly Val 



10465 10470 



10475 



2S p?f ? tg Stg 390 gag g ° g gtg gtg gcc gcg gca <? ca ^ 68444 

?i;«n 9 Gly G1U Ma Val Val Ala Ala Ala Ala G?y 

10480 10485 10490 

Ala pS ??» c C9 ?, fc ? Cga gt ° CCC CHg gag Cag gtc ttc ccc 68489 

fnLc SSr HlS Val Val Pro Gln Glu Gin Val Phe Pro 

10495 10500 105 05 

Lvs p™ ll 9 T Ctg ^ ^ C ^ Cag gCg gCg atg gtg ccc fftc 68534 

Y f „ 3 Gly LSU Ser Phe Glu Gln Ala A la Met Val Pro Val 
10510 10515 10520 

act ctg ttc acc gcc tgg tat gcc ctg gag gag ctg gcg egg ctg 68579 
Thr Leu Phe Thr Ala Trp Tyr Ala Leu Glu Glu Leu Ala A?g Leu 



10525 10530 



10535 



cgc gcg gga gag cac gtc etc ate cac gcc gcg gcc acg ggg gtg 68624 
Arg Ala Gly Glu His Val Leu He His Ala All Ala Th? Sly Sal 



10540 10545 



10550 



Sv l2 a?? ^ 2 ? f ag ttg 009 Ctg Cgg Cgg ggC gcc ac 9 ^ 68669 
GlY Ala Gly Val L ^ s L *u Ala Leu Arg Arg Gly Ala Thr Val 

10555 10560 10565 

Tyr lit ihr 111 ri C agC ^ CCg aag ° gC gag Ctg Ctg cgc tcc 68714 
tn*™ Y Ser Glu Pro Lys Arg Glu Leu Leu Arg Ser 

10570 10575 10580 

Leu G?v Si? ™J r Ctg ^ C 5 tSf gaC tCg CgC gCg CCC ggg fctt gac 68759 
G i y «R U Ala MSt ASP Ser Arg Ala Pro G ly ^e Asp 

10585 10590 10595 

gac cag ctt ctg caa cac acc cag ggg cga ggc gtg gac gtg gtg 68804 

P SU LSU Gln HlS Thr Gln GlY Arg G1 y Val Asp Sax Sa? 

10600 10605 10610 

etc aac teg ctg age ggc gag ttc etc tcc cgc age ctg ggc gtg 68849 
Leu Asn Ser Leu Ser Gly Glu Phe Leu Ser Arg Ser Leu Sy Sa? 



10615 " 10620 10625 

gtc gag ctg ggt 
Val Glu Leu Gly 
10630 10635 io 6 40 



etc get cgc cac ggg cgc ttc gtc gag ctg ggt gta acra aac ato rrrqa 
Leu Ala Arg His Gly Arg Phe Val Glu Leu g y SS A?J As P Me? 



L^u Ser Sv SYv JS? 9 ^ ° tg gCt CCC ttC gag cgc ggg ctc 68939 
?nL* Y LSU Pr ° Leu Ala Pro Phe G1 » Arg Gly Leu 

10645 10650 10655 

£ S £ S SI £ S S S S £ SI s £ £ 68384 

10660 10665 10670 
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gag ctg 
Glu Leu 

10675 



atg ggc gag gcc ctg egg cag ate gag cgc gga gag ctg 
Met Gly Glu Ala Leu Arg Gin He Glu Arg Gly Glu Leu 
10680 10685 



gag ccg ctg ccc tac acg gca tgg cca ctg gag cgc gtg gcc gag 

™n Pr ° W Thr Ala Trp Pro Leu Glu ^g Val Ala Glu 

10690 10695 10700 

gcg etc cag etc gtc tec aag ggc agg cac gta ggc aag gtg gtg 

5t?« B Gln LeU Val Ser Lys *** His Val Val Val 

10705 10710 10715 

g f a gag CCC ttg tca cgg CC ^ cca ffcg etc ccg age 

TS.n G1U G1U Pr ° LeU Ser ^ g Pro Pro Ala Pro Ser 

10720 10725 10730 



tec tca agg cca gtg caa ccc egg gca ggc gtg gcg 
Ser Ser Arg Pro Val Gin Pro Arg Ala Gly Val Ala 

10735 10740 10745 



att gtc ggg 
He Val Gly 



ctg cgc 
Leu Arg 

10750 



tec gcc gag ggc tgc gag gcc ttc gag cgc ate etc gcg 
Ser Ala Glu Gly Cys Glu Ala Phe Glu Arg He Leu Ala 
1° 755 10760 



gcc ggc ctg ccg cag gtg gcc gtc tec ace cgt gag ctg cgc gcg 

Val Ser Thr Arg Sg HI 

10765 10770 10775 

cgc atg gcg gag ate gag cgc ctg cgc gtc tec tec tgg gag cca 

9 ?E«n G1U G1U LeU Arg Val Ser Tr? gJS Pro 

10780 1° 7 85 10790 

t " S CC tCg gtg CCC CgC ggc tcc ccc CTCC agg ace gaa cgc 
r ° Val Pr ° ^ g Gly Ser Pro Ala ^ Thr Glu A?g 

10795 10800 10805 

tee aag ccc tat gtt get cct cgc ace gag cgc gag cgc gcc ctg 

fSi 0 ^ Pr ° ^ g Thr G1U ^ g ^ SS LeS 

10810 10815 10820 



gcc gcg etc tgg cag gac ctg etc ggc gtc tcc gag 
Ala Ala Leu Trp Gln Asp Leu Leu Gly Val Ser Glu 

10825 10830 10835 



gtg ggg ttg 
Val Gly Leu 



69029 



69074 



69119 



69164 



69209 



69254 



69299 



69344 



69389 



69434 



69479 



T C " C gag ° tg aaa ggg aac tc ^ ct ^ etc gcc ate 69524 

P ?nLn SP G1U LeU LyS Gly As * Ser ^ La lie 

10840 10845 10850 



cag etc 
Gln Leu 

10855 



atg ggg egg ttg cgc aag gag ctg gag gtg 
Met Gly Arg Leu Arg Lys Glu Leu Glu Val 

10860 10865 



gaa ttg cca 
Glu Leu Pro 



etc tcc tcc ttc etc gcc cgc 
Leu Ser Ser Phe Leu Ala Arg 

10870 10875 



cct acg ctg cgc aca ctg etc tcc 
Pro Thr Leu Arg Thr Leu Leu Ser 
10880 



A ! ^ Met 

i S >\ : 



—a gcc ccg gca acg gac gag cct gcc teg gcc ace ggg 
Pro Ala Pro Ala Thr Asp Glu Pro Ala Ser Ala Thr Gly 
10890 10895 



69569 



69614 



69659 



cca gcc gcc get ccc 
Fro Ala Ala Ala Pro 



cag ccg tca ccg get 
Gln Pro Ser Pro Ala 



gtc tcc age 69704 
Val Ser Ser 
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10905 



10910 



gaa ccc 
Glu Pro 

10915 

ggc gag 
Gly Glu 

10930 



get egg aag tgg cgc cac ctg gtg ccc ate caa cct cag 69749 
Ala Arg Lys Trp Arg His Leu Val Pro He Gin Pro Gin 
10920 10925 



age cct ccc ttc ttc tgg get gcg ccg ctg atg ggc acc 
Ser Pro Pro Phe Phe Trp Ala Ala Pro Leu Met Gly Thr 
10935 10940 



gtg ttt ccc tac ttc acg ctg gec cgc tgc ctg gcg 
Val Phe Pro Tyr Phe Thr Leu Ala Arg Cys Leu Ala 

10945 10950 " 10955 

ccc ttc tat get etc cag ccg cca ggg ttg cag gag 
Pro Phe Tyr Ala Leu Gin Pro Pro Gly Leu Gin Glu 

10960 10965 10970 

ccc atg gac aag gtg gag gaa ctg gcg gee ctg tac 
Pro Met Asp Lys Val Glu Glu Leu Ala Ala Leu Tyr 

1097 5 10980 10985 



ccc acc cac 
Pro Thr His 



ggg cag gag 
Gly Gin Glu 



gtg cgc gag 
Val Arg Glu 



ctg aga gag ctg cag ccg cga ggg ccc tac cgc ctg 
Leu Arg Glu Leu Gin Pro Arg Gly Pro Tyr Arg Leu 

10990 10995 " iiooo 



tec ttc ggc tgc gtg gtg get 
Ser Phe Gly Cys Val Val Ala 

11005 11010 



tat gag gtt gee eta 
Tyr Glu Val Ala Leu 

11015 



gga ggc tgg 
Gly Gly Trp 



cag etc gag 
Gin Leu Glu 



69794 



69839 



69884 



69929 



69974 



70019 



gcg get 
Ala Ala 

11020 



gga gag cag gtg gcg ctg ctg age ctg ctg gac ttc cct 70064 
Gly Glu Gin Val Ala Leu Leu Ser Leu Leu Asp Phe Pro 
110 25 H030 



111 ?ro til ??f ^ ??° CtC tgg gCg aCg gcc c *c ttc 70109 

Ala ?f?„ Ser G1 * Gln Ar ^ Ala Ser Leu Trp Ala Thr Ala Arg Phe 
11035 11 040 H045 



Phe lit ST c g S ? r g CgC ^ Ctg gCg CCC tat acg ctc ^ 
fTn^n Y Val LeU Gly Leu Ala Pro Thr Leu Asp 

11050 H055 H060 



tat ctc tac ctg gcg get cgc get cct ctg aat cca 
Tyr Leu Tyr Leu Ala Ala Arg Ala Pro Leu Asn Pro 

H065 11070 11075 



gac get gee 
Asp Ala Ala 



70154 



70199 



teg cgc ctg age age tec ctg 
Ser Arg Leu Ser Ser Ser Leu 

11080 11085 



acc cga gga tgg gag cgc ctg egg 70244 
Thr Arg Gly Trp Glu Arg Leu Arg 
1109 0 



ftt ni ?? g r 5 g g ° ag ^ Ctg Ctg gac cat S cc gec atg gee 70289 

Y ^n Q , ^ Val Gln G1U LSU Leu As *> His Ala Ala Met ila 

H095 11100 11105 



egg gtg gtg cct gag gac tea egg ctg ctg ctg ctg 
Arg Val Val Pro Glu Asp Ser Arg Leu Leu Leu Leu 

11110 11115 11120 



ggc att gec ccc atg ttg agg 
Gly lie Ala Pro Met Leu Arg 

H125 11130 



ctg acc egg get cac 
Leu Thr Arg Ala His 

11135 



cga gag cct 
Arg Glu Pro 



cag cga gee 
Gln Arg Ala 



atg ctc tec tat egg ccc cga ggt cgc ctc cgc cag cgc ate gtc 



70334 



70379 



70424 
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Met Leu ser Tyr Arg Pro Arg Gly Arg Leu Arg Gin Arg lie Val 
11140 11145 11150 

etc ttc cgc acc tec gag cag gec tgg age ttc gcg cga gac ctg 70469 
Leu Phe Arg Thr Ser Glu Gin Ala Trp Ser Phe Ala Arg Asp Leu 
11155 11160 11165 

Glv r CtC ^ C g ? C tCg gg ° gtg gat gtg cgc gag ^eg 7 0514 

Gly Gly Ala Leu Ser Ala Ser Gly Val Asp Val Arg Glu Ala 

11170 11175 11180 

ccg gga gac cat atg acg ctg ctg egg cca cct cac gtg gag egg 70559 

?}? oc ASP H±S Met Thr Leu Leu ^ g Pr ° His Val Glu A^g 

11185 11190 11195 

??! £? g ^ Ct ° ° gg gCC Ctg CtC agc tcc gc 9 cct ccg 70604 

™„ GlU Gln Leu Ala Leu Leu Ser Ser Ala Pro Leu Pro 
11200 H205 11210 

Thr A?g A?g gttCgggctg c 9 c 3ccagag gctgcctcat gcatgccctt 70656 

11215 

tggcagccag cccgagctgt ccacccactc cagcgcctcg etcagategt gcttttcgac 70716 

gatctgatcc aaggtgacca cctgcgcttc cacgctgttg cccagatgag agggcacccg 70776 

cggatcctct ctccaataat tgcctcgcgg cagcagcctc ttcgcctgga agtcgatggc 70836 

ctccactccc gccgctacgg cgatgtccac caaggcccag ggcaacttcg agttcaacag 70896 

ccacttccac cagccccagt tctcgttgta gcccttcagg tagtgeacca tgtcccccgc 70956 

cccctgggag aacacgagga ggtcccgcag gtccgccgcc ttggcgtaga tcgcctgggc 71016 

cacggctgcc atggtggggt tgttggcgaa caagcctcca tccacatagc cctggtacac 71076 

eggegagacg ateggcaggg agctggagcg cagggecagg tctaccaggc ggtctcctgg 71136 

gttgacgaag tcggcacccg gaaagttgtg aaagactege ggcttccact cccgtctgtt 71196 

ctegtcaggg ttttcatcgt ccaacttgaa ggeggggate accaccttgc gcttcaggtc 71256 

cccgagcgtc ttgtctccca ggatgtgggt gagegcaega gccatgttct tgctgggcgt 71316 

cagcgcgttc atgccaatga tggcccccaa tctggtcctg agegatgagt ggaacagctc 71376 

atgcgtgagc cagagctgcc gcgcctgctc gagccccacg gcaggctcgc gagccgccgc 71436 

gatgatgagc gcgctgatgg ctcccgccga ggtgcccgcc accaggtcga cctgatcaag 71496 

gaagceggge cgcaactggt egatgegett caacaggcca atggacagca gcgtccgcag 71556 

tcctcctccg tccaggctga ggatgcggta cttgccattc cggcgagccc cgtcggctcc 71616 

tggctgccca tggatgatgg tcatttctcc acctccggcg gtccgaaggc gagatgeege 71676 

2.gc*-gr-ggcc accccatgtc aagcaggtgg ccgtcaggcc gttcgaaccc ctgctcagct 7173 6 

ccgcgcctct cccgactcgc cactgaactc acgctgcgcg ccagaggctg tctcctgcgt 71796 

ggccrcgggt tgccagcccg agttctccac ccactccagc accttgctca gategtgect 71856 
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cttgacgacc cgatccaagg tggcgatctg ggcctccaca ctgttgccca ggttgaaggg 71916 
caccaccgga tcctccctcc agtagttgcc ccgcggaagc agcatctttg cctggaagtc 71976 
gatggcctcc actcccgcct cgatggtggc ggccacgaag gccatgggct gcttcgggtc 72036 
cagcagccac tttctccagc cccagttctc gttataaccg tccaggtagt ccaccgagtc 72096 
gcctgtgccc agggagaaca cgaggatgtc ccgcacatcc gccgccttgg cgtagatggc 72156 
ctgggccacg gctgacatgg tggggttgtt ggcgaacaag cccccatcca catagccctg 72216 
atggaccggc gagacgatcg gcagggagct ggagcgcagg gccaggtcca ccaggtagtc 72276 
atccaggttg acgaaagtgt cacccgggaa gttgtggaag attcgcggcc tccacccgcg 7233 6 
cctgtcctca tcagggtcgc catcgtccag ctggaaggcg gggatcacca ccttgcgctt 72396 
caggtcccgg agcgtcttgt ctcccaggat gtgggtgagc gcgcgagcca tgttctcgct 72456 
gggcatcagc gcgctctggc caaccagggc ccccagctga ttggagagtg ggctatcgaa 72516 
cagcccaggc gtgaaccaga tctgccgcgc ctgctcgagc cccacggcag gctcgcgagc 72576 
cgccgcgatg atgagcgcgc tgatggctcc cgccgaggtg cccgccacca ggtcgacctg 72636 
atcaaggaag ccgggccgca actggtcgat gcgcttcaac aggccaatgg acagcagcgt 72696 
ccgcagtcct cctccgtcca ggctgaggat gcggtacttg tcattccggc gagccccgtc 72756 
gcctcctgtt ttccgagtgc ccgaagtcat ttcgtcactc caggcagagg gctctcgggg 72816 
ggcacactgc cctcacgaag gggaggtacc ccacggaagc gcttcgaggg cgaggcaggt 72876 
actggcttgc cggagggatg ggggactcca tgcttgtcca ccggcaccgc gctctgtgca 7293 6 
tgggactccg cgtcttccaa atcctgctcc accacctcca gcatggcgcg cagcaactcc 72996 
ctgcttgccc gagtggcccc gagtcggagc gtctccatgg ccctcaggta cctgcccagc 73056 
cctgccacgg gcatcttctg cgtcactgcc gccgccgtac tgggcgtctg tccttgtatt 73116 
tgtaccgcgt ccctgtagtc cacgtttcgg ccctccccaa cgaaagtgtc atgcacagcc 73176 
gggattcctc actcactgta catggcacct caggtgtctt ccacccgctt gcgctcgcat 73236 
gttcatattc atcgtttgcc ctggagtgtg cgattgacct tgggctccgt gcgtagaagg 73296 
ccgtgaaaat ccaagccgct cctccaaggc attgcgcgtc ctaacgatgg gaggggatcg 73356 
gggaagtgca gtagccagaa aagcaacggg ctgccaggca gctatctaaa tgggcgttgc 73416 
gtgcgtgcgc cccacctgcg ctcacagact tgggagcagc caaccgtcag gtggagaacc 73476 
atctagcgaa gtagatgctc ccacgggcaa tccccctgcc aaggcatgac ggaccaccaa 73536 
tcccatggtt gctgggccac tccaagcgtg gcggcgccgg tgaaagacct tttcgcgcgc 73596 
attgccaaac agttgcgcga aatcccaact ccctcctgac cggcgcgtcc cttcgtgtga 73656 
cagatccggg ccaggatcgg acgaatcatc gtgtgctggc ttctcttcga gacgtgaagt 73716 
tccacgcgac cgggccgccc cggaggtgcc agccttactt tactggtgtg attgtgatac 73776 
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agcctggtaa tgaagtattt cagggaaatc gatttcgcgt gtgccacgtc aatgcgaatg 73836 

ggatcctctc aaatcgtctt ccacccagaa cacgaacccg aagcgttgcg ccggcatgaa 73896 

gaagtccgag ccatcccacg aggcgggatc gaacagcagc cccttccaca cagggaagac 73956 

gcctcccgga aagatacgcg gcacctcgac gctcatggat tgttcgaggt tgccgcaccg 74016 

gcccgtgacg gccaggccgc cgtagccggg aaccggctgc ttctgcttgt cactgagtgg 74076 

agccgcatgc gtcgtccaaa tcactttcgg acgagcgcat caggatcccc atcaagatca 7413 6 

ccggcaaacc gctcgactat tcccacgcgg ccttcagcat ccccgtcgtc cacgcccgtg 74196 

tactcctcgg tgtccgggat gcgcatcgac acgacgaagg tggacggcgc caaggtgttc 74256 

cgaacctggg gctggaccgt ggccttgatc gtggccgagg agatcaagga agccctggag 74316 

cgcgcgggcg tcaccggcgc gaagttcacg gacgtcaccg gccccggtgc agccaatggc 74376 

tcctgactct gccctcccca ccctcgtacc caggcccggt ggttgttcga ggtggattcg 74436 

cgcagcgtcg tcacggatgt gcctgcggac gagaagctgt ccgccttgaa tgccgcgacg 74496 

accgccttct cagatccgtc tgctcgtagc tcacccacgg gcccaccacc tcaagggagc 74556 

gtacgttgcc ctcaacccgg tacaacgact gatggcccct ccagggcccg agccgcgcgg 74616 

gttcggcgca cggctgaggc cccgcggccc accctgccct atcgccgggc gctcttcgcc 74676 

ttcggcgcct tcttcttcgc cgcgggcttc gcacgcttcg cggccgccgg ggccttgcgc 74736 

gccggacgct tcgcggcgcg ggccggcggg cccgcccaca cctcgagcgc ggcgcgcgtc 74796 

tcggccgcgc gcgtcthgtt gcccagcgcc tcctgcagca gcgcgagccc ctccatggcg 74856 

tggcgcacct gcgcgtccgt gagttcctcg tgctggaaga ggttgcgata cgccttggcg 74916 

agccgggcga gcgcgacgat gggctcggtc accacgccgc tgcccgtggc ctgcggcgag 74976 

cggaagtgcc cgagccggta gctgatgctg ccccagtccc agcgctcgct cgtgggacgg 75036 

ccctgcgcca gccggtcgcg ctccagcgtc ccctgccagt cggcgccgaa ctgggcgcgc 75096 

agggctttct cgaggaagac ctggagcgcg tcgcgcgcgc tgcccagggc cgggtcgagc 75156 

atcgcactga cggtgacatt gcgggtgctg gcggtcgttc gacggatgga caccgggtac 75216 

tgctccttcg agagagaggg aggccccagg aatgccctgg cccgaagcgc cgcgcaaaaa 75276 

aacggtgtcc ccctacgcgg gcttccgcac cacgcgctga cggcgggttc cctcactctc 75336 

aactgccggc gagctgttcc cacgggaacc gggtcaactc gctgtccgtc tcattccgat 75396 

aaacttcgcg ttaacccgaa aaaggggctc gattctctat ggggaggtca ctggaagccc 75456 

acgcatcccc aggcgcacag gagtctcccc c atg aag ttc ccc cct get teg 75508 

Met Lys Phe Pro Pro Ala Ser 
11220 

ttg tgc ttc gtg ttg ccg atg etc tec gee act tec gtc cag gcg 75553 
Leu Cys Phe Val Leu Pro Met Leu Ser Ala Thr Ser val Gin Ala 
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11225 



74 
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11230 



11235 



caa gcg ccc gag acc tgg acg 
Gin Ala Pro Glu Thr Trp Thr 
11240 11245 



gag cac tgg ttc gag cac aac cag 
Glu His Trp Phe Glu His Asn Gin 
11250 



acc gtc 
Thr Val 
11255 



tec cgc gtc tac cag 
Ser Arg Val Tyr Gin 

11260 



gac aat gac gtg gec 
Asp Asn Asp Val Ala 

11265 



gtg tac ttc 
Val Tyr Phe 



75598 



75643 



gac teg gee gtg aac egg tec ate acc tgg ccc aac acc tac ctg 7568 8 
Asp Ser Ala Val Asn Arg Ser lie Thr Trp Pro Asn Thr Tyr Leu 
±1270 11275 11280 



ggc gac gtg tgg cga tac acc aag agg acc tac gga 
Gly Asp Val Trp Arg Tyr Thr Lys Arg Thr Tyr Gly 

H285 11290 " 11295 

acg gac teg cag etc tac acc ate ttc cac gca gga 
Thr Asp Ser Gin Leu Tyr Thr lie Phe His Ala Gly 

H300 11305 11310 



cac ttc ggc 
His Phe Gly 



aag tac agt 
Lys Tyr Ser 



gga ggc cac ccc tec acg tac ttc gac gee age cat gac agg cgc 
Gly Gly His Pro Ser Thr Tyr Phe Asp Ala Ser His Asp Arg Arg 
11315 11320 11325 



aac gtg att gat gtg ggt tec 
Asn Val lie Asp Val Gly Ser 

11330 11335 

ggc acc ggc aac gac ctg gac 
Gly Thr Gly Asn Asp Leu Asp 

11345 11350 



agt tec ccc acg gee tgg act gee 
Ser Ser Pro Thr Ala Trp Thr Ala 
11340 

ate gtc acc cat gag gtg gee cac 
lie Val Thr His Glu Val Ala His 
11355 



75733 



75778 



75823 



75868 



75913 



ate gtg gaa etc gee age aag ggc gtg cac gac teg 
He Val Glu Leu Ala Ser Lys Gly Val His Asp Ser 

11360 11365 11370 



cct gee ttc 75958 
Pro Ala Phe 



ggc ctg tgg cgc gac age aag tgg gcg gag ate ttc aac tac gac 76003 
Gly Leu Trp Arg Asp Ser Lys Trp Ala Glu He Phe Asn Tyr Asp 
11375 11380 11385 

gtc tac ctg gcg ttg ggt cgc acc agt gat gee aat cgc tgg tac 76048 
Val Tyr Leu Ala Leu Gly Arg Thr Ser Asp Ala Asn Arg Trp Tyr 
H390 11395 11400 



aac ctg atg gtc aac acg acg gac age ttc ccc cgc 
Asn Leu Met Val Asn Thr Thr Asp Ser Phe Pro Arg 

H405 11410 11415 

cac tgg ttc egg gat tgg ttc tat ccc ate tac aag 
His Trp Phe Arg Asp Trp Phe Tyr Pro He Tyr Lys 

H420 11425 11430 

ggc tec teg gtg etc aac egg tac ttc gtc ctg etc 
Gly Ser Ser Val Leu Asn Arg Tyr Phe Val Leu Leu 

H435 11440 11445 



ctg ccg aag aac ggc age aac 
Leu Pro Lys Asn Gly Ser Asn 

11450 11455 



tac gcg cga age ctg 
Tyr Ala Arg Ser Leu 

11460 



gee aac acc 
Ala Asn Thr 



aac cac ggc 
Asn His Gly 



gcg cag tac 
Ala Gin Tyr 



aac tgg gga 
Asn Trp Gly 



76093 



76138 



76183 



76228 



gag ttc ate cac ttc tgg agt ggt gee gcg ggc gtc aat etc aag 76273 
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Glu Phe He His Phe Trp Ser Gly Ala Ala Gly Val 

114 65 11470 11475 
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Asn Leu Lys 



ace ctg gec acg tec gca ttc gga tgg ccc gee gag tgg gaa gcg 
Tnr Leu Ala Thr Ser Ala Phe Gly Trp Pro Ala Glu Trp Glu Ala 
11480 11485 11490 

cag ttc ace cag get cag cgt gac ttc ccc ttc ace tac acc ccc 
Gin Phe Thr Gin Ala Gin Arg Asp Phe Pro Phe Thr Tyr Thr Pro 
11495 11500 11505 

ccg gg C ccc acg get gtc acc gtg ttc cag gac cag aac tac ggt 

?J?, « Pr ° Thr Ala Val Thr Val Phe Gln As P Gln Asn Tyr Gly 
11510 11515 11520 

ggc tat gga atg gca ctg ccc gtt ggc cgt tac acc ctg tec ccc 
Gly Tyr Gly Met Ala Leu Pro Val Gly Arg Tyr Thr Leu Ser Ala 
11525 11530 11535 

ctg cat gcc tgg ggc gtg cgc aat gac gac ate acc tec ctg aag 

™„ Ala Trp GlY Val Asn As P As P xl * Thr Ser Leu Lys 

11540 11545 11550 

S? ^ C ta ° aag gtC aCC ctg fcat gaa cat 9 ac aac ttc 

A J" a « SSr Gly ^ Lys Val Thr Leu Glu His A sp Asn Phe 

11555 11560 11565 

age ggt gee age etc acc aag acg gcg gac gat gee teg etc gtc 

?J"c, A Ser LSU Thr Lys Thr Ala As P As P Ala Ser Leu Val 

11570 11575 11580 



76318 



76363 



76408 



76453 



76498 



76543 



76588 



gac gac age tgg aac gac aaa atg tec teg etc ate gtg age gcg 76633 

P t?? OB SSr Trp Asn Asp L ^ s Met Ser Ser Leu He Val s!r Sa 
115 85 11590 11595 



teg ggg atg tag gtcagcgccg ctcgggctcc ggggcgtgcc gggcttcctt 
ber Gly Met 
11600 



76685 



cgcgagccct ccccggcgcg ttcgcgccag gggaatggcg ggccgcccat ggattggaga 76745 

76798 



atgggtcgcc cccaacgggc ccaccgcggt acacagtgcg c atg ttc gac gec 

Met Phe Asp Ala 

11605 



act cgc tac ctg gag cgt ate ggt gtc teg gcg ggt acg ccg etc 
Thr Arg Tyr Leu Glu Arg He Gly Val Ser 111 £ y T h? Pr? leu 

11610 11615 11620 

lit !?? r CtC S at ^ gg g ? C CaC Ctc gag gcc artff ccg ttc g ag aac 
Ala Ala Leu His Arg Ala His Leu Glu Ala Val Pro Phe Glu Asn 

11625 11630 11635 

x" C I" tV* S aC T Ctg f ag cgg ccc atc cgg ctc gat w gcg gee 
Leu Asp He His Leu Lys Arg Pro He Arg Leu Asp Glu Ala Ala 

11640 11645 11650 

aag gtC gtC Ctc cag cgg cgc ttc tgc tac 

«v Pr.e Glu Lys Val Val Leu Gln Arg Arg Gly Gly Phe Cys Tyr 

11655 11660 11665 



gaa ore aat ggc ctc 
u*±\i Asn Gly Leu 



ttc gcc cgc ctg ctc 
Phe Ala Arg Leu Leu 



cgg act ctc ggt tac 
Arg Thr Leu Gly Tyr 



76843 



76888 



76933 



76978 



77023 
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11670' 

egg gtg acg ctg ctg teg gcg 
Arg Val Thr Leu Leu Ser Ala 
11685 

age gee tac ggt ccc gac ttc 
Ser Ala Tyr Gly Pro Asp Phe 
11700 

gac gee teg ggc cgc tgg ctg 
Asp Ala Ser Gly Arg Trp Leu 
11715 

ttc ctg gag ccg ctg cgc etc 
Phe Leu Glu Pro Leu Arg Leu 
11730 

gac gga cgc ggc cac egg ctg 
Asp Gly Arg Gly His Arg Leu 
11745 

gtc tgg cgc gag gcg gcg teg 
Val Trp Arg Glu Ala Ala Ser 
11760 

gtc teg etc ate ccg cgc gag 
Val Ser Leu lie Pro Arg Glu 
11775 

cac cac cag cag acc teg ccg 
His His Gin Gin Thr Ser Pro 
11790 

etc tgc acc egg acc acg ccg 
Leu Cys Thr Arg Thr Thr Pro 
11805 

ggc gcg etc gtc gtc acc teg 
Gly Ala Leu Val Val Thr Ser 
11820 



76 

11675 
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11680 



cgc gtg gee teg cgt ccc gat ggc 77068 
Arg Val Ala Ser Arg Pro Asp Gly 

11690 ^ H695 

gac cat ctg gcg etc ctg gtg gag 77113 
Asp His Leu Ala Leu Leu Val Glu 

11705 11710 

gee gac gtg ggc ttc ggc gag tgc 77158 
Ala Asp Val Gly Phe Gly Glu Cys 

H720 H725 

gac gag cgc ggc gtg cag acc cag 77203 
Asp Glu Arg Gly Val Gin Thr Gin 

11735 11740 

gtc gag gac gcg gag ggg etc gtc 77248 
Val Glu Asp Ala Glu Gly Leu Val 

11750 11755 

ggg gac tgg aag gee cag tac gtc 77293 
Gly Asp Trp Lys Ala Gin Tyr Val 

11765 11770 

etc ggg gac ttc gcg gee atg tgc 7733 8 
Leu Gly Asp Phe Ala Ala Met Cys 

11780 11785 

gag tec cac ttc acg cag cga cgc 77383 
Glu Ser His Phe Thr Gin Arg Arg 

11795 11800 

gac ggg cgc ate acc etc aag gag 7742 8 
Asp Gly Arg lie Thr Leu Lys Glu 

H810 11815 



ggg ggc gcg cgc cac gag cag 
Gly Gly Ala Arg His Glu Gin 
11825 



etc ccg gac gag gec 
Leu Pro Asp Glu Ala 

11835 

ggc ate acc ctg ccc 
Gly lie Thr Leu Pro 

v 11850 

aggagcegge ccactctacc 

gcacttcact gagegtagaa 

ggtgatggat caaattctaa 

etteggttte aggagtgece 



gcg tgg egg gee gcg etc acg egg cac 
Ala Trp Arg Ala Ala Leu Thr Arg His 
11840 

tec ggc gga tga cgccgctccg gaegcacatg 
Ser Gly Gly 



egg 
Arg 
11830 

ttc 
Phe 
11845 



77473 



77518 



77565 



tgccgccgtg aaggccccgg gaegggtccg caggccgcca 77625 
atccaaagga ctcgccgttt gccctgctgg cactggggcc 77685 
gaaacttcga gcccgcacct tcgacaatcc agccagacac 77745 



cga atg acg acg tec aag gtc egg tec ggc 
Met Thr Thr Ser Lys Val Arg Ser Gly 
H855 11860 



77795 



gee acg ctg ggt gga 
Ala Thr Leu Gly Gly 
11865 



gtc act gec agg ttc age acc ggg gag cag 77840 
Val Thr Ala Arg Phe Ser Thr Gly Glu Gin 
11870 11875 
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W 77 ^ 

ccg gcc ccc aag aag tac gtg gat tac acc gtg aag age ggt gac 77885 
Pro Ala Pro Lys Lys Tyr Val Asp Tyr Thr Val Lys Ser Gly Asp 
H880 11885 11890 

acg ttc teg ggc ate gcg gcg aag cac gag etc teg ctg gcc gca 77930 
Thr Phe Ser Gly lie Ala Ala Lys His Glu Leu Ser Leu Ala Ala 
H895 11900 11905 

ttg aag aag ctg aac ccg cag gtc gag aac ate aac gtc ate cac 77975 
Leu Lys Lys Leu Asn Pro Gin Val Glu Asn lie Asn Val lie His 
11910 11915 11920 

ccg ggc cag aag ctg cac gtg aag gtc gtg aat gcg ccc gcg cct 78020 
Pro Gly Gin Lys Leu His Val Lys Val Val Asn Ala Pro Ala Pro 
11925 11930 11935 

gag ccg aag ccg aca ccg aag ccg ccg acc aac aaa aac acc age 78065 
Glu Pro Lys Pro Thr Pro Lys Pro Pro Thr Asn Lys Asn Thr Ser 
11940 11945 11950 

gtg etc ccc aag ggc ate ccg aac acc gag ggc atg tct cag gcg 78110 
Val Leu Pro Lys Gly lie Pro Asn Thr Glu Gly Met Ser Gin Ala 
H955 11960 11965 

aag gag tac gcg etc tac teg aag tac gtg gag aag tac ggc gac 78155 
Lys Glu Tyr Ala Leu Tyr Ser Lys Tyr Val Glu Lys Tyr Gly Aso 
H970 11975 11980 

gcc aag gcg aag cag gac etc gcg gcc ggc aag cgc gtc ate gtc 78200 
Ala Lys Ala Lys Gin Asp Leu Ala Ala Gly Lys Arg Val lie Val 
H985 11990 11995 

ggc ctg agg gtg aac acg ccc ttc acc cgt gat cgc ccc tec ggc 78245 
Gly Leu Arg Val Asn Thr Pro Phe Thr Arg Asp Arg Pro Ser Gly 
12000 12005 " 12010 

ggc tec tac gac gat egg etc gtc gtc atg tgg aag gac age age 7829 0 
Gly Ser Tyr Asp Asp Arg Leu Val Val Met Trp Lys Asp Ser Ser 
12015 12020 12025 

ggc aag ccg cac gtc cag gag ttc aag gcc aac acc gag ccc aac 78335 
Gly Lys Pro His Val Gin Glu Phe Lys Ala Asn Thr Glu Pro Asn 
12030 12035 12040 

cgc cgc tgg gC c gat gac ccg age cag age acc aag ccg gtc ggc 78380 
Arg Arg Trp Ala Asp Asp Pro Ser Gin Ser Thr Lys Pro Val Gly 
12045 12050 12055 

cgc etc gtc ggc aac aag acc tac cac tac egg aag age ttc aac 78425 
Arg Leu Val Gly Asn Lys Thr Tyr His Tyr Arg Lys Ser Phe Asn 
12060 12065 12070 

ggc aac ttc ggt gg c aac ate etc gcg ccg gac ctg cgc tac ggg 7847 0 
Gly Asn Phe Gly Gly Asn He Leu Ala Pro Asp Leu Arg Tyr Gly 
12075 12080 12085 

^ CS t?° gtg cgg cgc gac acc aac c Sc gat cac egg ate aac 78515 
Asn Pro Thr Val Arg Arg Asp Thr Asn Arg Asp His Arg He Asn 
12090 12095 12100 

teg aag gac gat gtg ttc age ggc gac tgg gg c ggc cag ggc tac 78560 
ser i--ys Asp Asp Val Phe Ser Gly Asp Trp Gly Gly Gin Gly Tyr 



WO 2004/022586 

12105 

tac ttc cat cgc ggc 
Tyr Phe His Arg Gly 
12120 

cag acg atg gac cag 
Gin Thr Met Asp Gin 
12135 

ggc tct cag age gag 
Gly Ser Gin Ser Glu 
12150 

cgtggcgtag gggacccact 
egtaagaect cgccgtgccc 
cacacgccag gaggtggtcc 
gattgaaggc cgggcggcgc 
eggtgegcta cgccgagcac 
aactgggcgt gtccgggccg 
gccgtccgaa gcccgccgag 
ctgccgcacc tgtcaggtag 



78 
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12110 



12115 



gga acg acc gac acg tac tec gec ggc tgc 78605 
Gly Thr Thr Asp Thr Tyr Ser Ala Gly Cys 
12125 12130 

ggc cgc ttc aac aac ttc tgg gec gcg etc 78650 
Gly Arg Phe Asn Asn Phe Trp Ala Ala Leu 
12140 12145 

ttc age tac gtc ctg gee cag gtc ggc tga 78695 
Phe Ser Tyr Val Leu Ala Gin Val Gly 
12155 12160 

ccatccggaa gacccggaca cccgccgcgc gagcaggagt 7 8755 

cgtcgtaacg ggctccagcc ttcgtgagac ttcccttggc 78815 

atggagaagg agctggagca gttccgtcag gaagegcage 78875 

ggtggctcgt tgccattccc egaggegctg cgcgccttcg 7893 5 

accatggega agggtgggac ggtgacggac gcggcgcagc 78995 

acgctctacg agtggcgcaa ggggcgcccc gcgggacacc 79055 

aagggegegg ctctggtgcc ggtgcacgtc agegagegtc 79115 



ttg gag acg gee aag 
Leu Glu Thr Ala Lys 

12175 



tc atg acg cgc gtc gag cag ctg cga gac 
Met Thr Arg Val Glu Gin Leu Arg Asp 

12165 12170 

cag ggg gcg gcg ctg ctg gag gcg gaa aac 
Gin Gly Ala Ala Leu Leu Glu Ala Glu Asn 

12180 12185 



gec egg ctg cac cag 
Ala Arg Leu His Gin 

12190 

egg etc aag ggc gag 
Arg Leu Lys Gly Glu 

12205 

gag acg cat tgg ctg 
Glu Thr His Trp Leu 

12220 

tac gcg tgg acg gtg 
Tyr Ala Trp Thr Val 

12235 

tac ccc age cgc tec 
Tyr Pro Ser Arg Ser 

12250 

tac gag ggc gtc gtg 
Tyr Glu Gly Val Val 

12265 

gcg acg aag tec age 
Ala Thr Lys Ser Ser 

12280 



cgc ctt gag gcg etc gtc gcg gag aat gee 
Arg Leu Glu Ala Leu Val Ala Glu Asn Ala 

12195 12200 

gac gee cag gcg cac ctg cac cac gcg gat 
Asp Ala Gin Ala His Leu His His Ala Asp 

12210 12215 

ctg ctg gac. aag ggg ccg ggc acc aag tgg 
Leu Leu Asp Lys Gly Pro Gly Thr Lys Trp 

12225 12230 

gee teg ccc gac acc gtc ttc cat cgc ate 
Ala Ser Pro Asp Thr Val Phe His Arg lie 

12240 12245 

ggg gcg acg gcg cgc acc gtg ctg ggc gac 
Gly Ala Thr Ala Arg Thr Val Leu Gly Asp 

12255 1226O 

ctg gtg gat ggg tat gcg gcg tac cag acg 
Leu Val Asp Gly Tyr Ala Ala Tyr Gin Thr 

12270 12275 

gcg gac ggg ccc tgt cca gec acc ctg gec 
Ala Asp Gly Pro Cys Pro Ala Thr Leu Ala 

12285 12290 



79164 



79209 



79254 



79299 



79344 



79389 



79434 



79479 



79524 
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ttc tgc tgg gcc cac 
Phe Cys Trp Ala His 

12295 

ttt gcc cct gcc tgc 
Phe Ala Pro Ala Cys 

12310 

tac gcc gtc gag gcg 
Tyr Ala Val Glu Ala 

12325 

gag gag cgt cag gcc 
Glu Glu Arg Gin Ala 

12340 

cag aag tec gcg ccg 
Gin Lys Ser Ala Pro 

12355 

gcg cag cgc gca ctg 
Ala Gin Arg Ala Leu 

12370 

tac atg etc aac etc 
Tyr Met Leu Asn Leu 

12385 

ccg cag gtg ccg ctg 
Pro Gin Val Pro Leu 

12400 

gac atg gtg ata ggc 
Asp Met Val lie Gly 

12415 

gga gag gcc ccg ggc 
Gly Glu Ala Pro Gly 

12430 



79 

gtg agg aga aaa ttc 
Val Arg Arg Lys Phe 

12300 

aaa gag gtg ctg gac 
Lys Glu Val Leu Asp 

12315 

gac ctg ccg ggc tgg 
Asp Leu Pro Gly Trp 

12330 

gcg etc gcg cac egg 
Ala Leu Ala His Arg 

12345 

ctg acc gag cgc ate 
Leu Thr Glu Arg lie 

12360 

ccc ggc agt gcc ttt 
Pro Gly Ser Ala Phe 

12375 

tgg gcg ggg ctg acg 
Trp Ala Gly Leu Thr 

12390 

gac aat aac cac gtc 
Asp Asn Asn His Val 

12405 

agg aag aac etc aag 
Arg Lys Asn Leu Lys 

12420 

cac tac ctg egg cgc 
His Tyr Leu Arg Arg 

12435 
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79569 



ttc gaa gcc cag cag 
Phe Glu Ala Gin Gin 

12305 

etc ate ggc gag ctg 79614 
Leu lie Gly Glu Leu 

12320 

tac gcg ctg gag ggc 79 659 
Tyr Ala Leu Glu Gly 

12335 

etc gcc gtg cgt cag 79704 
Leu Ala Val Arg Gin 

12350 

cgc gac tgg gcg cac 79749 
Arg Asp Trp Ala His 

12365 

cgc aag gcc etc gag 79794 
Arg Lys Ala Leu Glu 

12380 

gtc ttc etc acc cag 79839 
Val Phe Leu Thr Gin 

12395 

gag cgc cag ctg cgt 79884 
Glu Arg Gin Leu Arg 

12410 

aca gcg egg ctg cga 79929 
Thr Ala Arg Leu Arg 

12425 

cct gcg etc gcc gcc 79974 
Pro Ala Leu Ala Ala 

12440 



ate gag aat ccg ggc 
He Glu Asn Pro Gly 

12445 



acc gtg acg ctg ccg 
Thr Val Thr Leu Pro 

12450 



aag gcc cag age tga 
Lys Ala Gin Ser 



80019 



ccaccgctcc 


cgctgtgcgg cccgggcggg tcagctctta 


gttegggetc 


agegegaagg 


80079 


cggacacccg 


gccatccggg atgagegeca cccggtagcg 


cagcategtg 


gagccggcct 


80139 


gcaccaggta 


cgtgtagacg cggtcatcgc cccgcatctc 


ccgcttcgcc 


aggacgagcg 


80199 


gccccgacgg 


tcccaggctc tgtagcagcg actggtaata 


cggcgcggcc 


teegtgaaga 


80259 


accacccggg 


gaegtacgeg aacteggcag ggttcagcgc 


tccggcccgt 


gcctgctcga 


80319 


gcagcgtcgc 


gagccgggcc gtgacctccg gttcgaggtc 


cggaatgggg 


gecageggag 


80379 


geaeggegag 


cgcggggttg aegatggegg egatgecatt 


ggcgatggcc 


gcggggttcg 


80439 


gctccgcgag 


gttgagcagc acgatgatgg acagggegtc 


tccggtgaag 


eggtagtagg 


80499 


eggtcttgaa 


gccctgccat gcaccggcgt gctggtgcac 


gagctggccg 


ttgcgctcct 


80559 
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cgatgaacca tccgaagcca tacggatagg aagctccgct ggagagcttc acgggactca 80619 
ggatgtcccg ccagctcccg ttcgtcagga tggcgcggtc atccacggcg tcatcccacg 80679 
ccagcatgtc tttcagggag aagtagagcg agccatccgc cgtggtgttg agggaggggg 80739 
agacccactc ctggttcttc accaccccgt cgagcaggcg gtaaccggcc gcgcgattgg 80799 
gaacaatgtc agcctcactg atgcctcgcg ccgtcttcat gcgggtgggc ttgaagacct 80859 
gctcgcccag gacgttgacg taggaggtgc ccgcgacgcg gttcaccagg aggcccagca 80919 
ggacgtagcc ggcgttgctg tagttccatc gcaggccggc ggggaagtcc aggggcagcg 80979 
tatagaggaa ctgcgcgaac tcctcgtcgg tgtagtcctt gcgcaggtcg agcagcccat 81039 
ccgagaggcc ggaggtgtgc gtcagcaggt ggcgaatcgt gatgggggcc caggtcgagg 81099 
gcgcgtcggg gaagtacttc gtgatgctgt cggacagcga gatgcggccc gcctccacct 81159 
ggagcatcac cgccatggcg gtgaactgct tgcccaggga gccggactgg aagagggtgt 81219 
ccggggtgac gggaacctga tgctccaggt tcgagaaacc ataccccttc gccacgatga 81279 
ccctgccgtg gctcacgacg gccaccgcga cgcccggcac attcatccgt ttctgctccg 81339 
cgcgcacgaa ggcatcgatg cgctcacgga gcgattcatg ggctgacgag ggacggcccg 81399 
cgagcagacc cacgaggaac accatcgcca gggaaaagga ctgcaacgaa cctcgcatgg 81459 
ggaactccag aggtcccggg cacggattgc ccacggaccc gcgtagggga acaccgtcct 81519 
tcctccgtcg taaaggcttc ggggcggtga gaactacgat agttgtcgta tgaggggttc 81579 
tcttggactc tcatccgctt ccggtttagc cgccgctccc gagcgaagcc tcacgcttgc 81639 
gtcgttcccg ctctcggcgc tgctggcgtg agcgggccgt cacctcggca atgagggggt 81699 
acggcgagga gttaccagga gtcagccctc tttcccgggc tcgagcgggg cgagcggaag 81759 
ccgcccgagc tcctgcgcgc gcgcgtgcac cgcgcgcacc gccgcgtcca tcgcatcccg 81819 
gacgcccgcg ctctcgaaca ccttcaagcc ggccagcgtc gcgccaccag ggctcgtgac 81879 
ctgcttgcgc aacgccgccg gggactcctg gctgcgatgc agcaactcca ggcttccgcg 81939 
catgctctcc accgtcgcca gggtggcctg atcacgcggc agccccaggg ccacgcccgc 81999 
gtcgatgaac gcctcgatga ggaaatagac gaacgcgggc atggggccgg cgaggcccgt 82059 
cgccatctcc acgccggcct cgtcggcgac ctccacttcg gtgccgagcg cgcccagcag 82119 
tgagcggacg cgcaccagcg ccggctcgct caagcccgcg cttggaatcc agaaggtggc 82179 
cgccttgcgg acccgcgcgg gggtgttggg catcgcgcga acgacgagtg ggtgttcgag 82239 
accctcgcac agcgtctgca ggcgcgcgcc cgcgatgatg ctgacgacaa tcgtgtcctc 82299 
acggagctgc ccgcgcagct gctggaggac aacacccagg tgttgcggct tcacggccag 82359 
cacgaccagt tccgcgtccg ccgccgcgag gttgtccgcg gtcacctcga cgccatactg 82419 
ctcgcggagc acccggcgac gctcctcgtc gacctctccg acgaggagcg tgccgggcgc 82479 
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ggccaggcct cccccggtga ccgctccaat caaggcctcg gccatcaccc caccgccgat 82539 

gaagcccagg gctggcctcc gcgagggcac ggtggcatcc atcgaatcgg ccgtcatcgc 82599 

gaggtacgaa cgggattggg gcggatgagt cccttcacga tgcgacccgt ctcgacatcg 82659 

gccagggcct gattcacctc ttcgaggccg taggtgcggc tgaccatgtg ctcccagccc 82719 

ttgccaccgg cgaccctcgc accgtgccgg tcgaggatgc ggagcatccg gtggaagt^g 82779 

ctgaagtcca cgccccacac acctcgaatc tcgaggtgct ttcgattgat ccctttagtg 82839 

agggttaatt gcggccgcga attcgtaat 82868 

<210> 2 

<211> 481 

<212> PRT 

<213> Angiococcus discif ormis 

<400> 2 

Val Thr Arg Pro Tyr Pro Ser Ser Leu Pro Arg Leu Gly Gly Lys Pro 
1 5 10 15 

Ser Ser Gin Met Ala Arg lie Leu Leu Val Asp Asp Glu Lys lie Ala 
20 25 30 

Arg Thr Leu Tyr Gly Asp Tyr Leu Thr Ala Val Gly His Ala Val Thr 
35 40 45 

Ala Val Gly Ser Leu Gin Asp Ala Arg Glu Ala Leu Ala Gly Asp Arg 
50 55 60 

Phe Asp Ala Val Val Thr Asp Leu He Leu Pro Gly Gly Asp Gly Met 
65 70 75 80 

Glu Val Leu Arg His Val Arg Glu Arg His Pro Gly Val Glu Val Val 
85 90 * 95 

v«x lift liar Gly Leu Glu Lys Val Asp Pro Ala Val Arg Ala He Lys 
100 105 HO 

Ser Gly Ala Ala Glu Tyr Leu Val Lys Pro Val Ala Pro Glu Ala Leu 
~ 120 125 

-1- His Ala Val Arg Arg Ala Leu Thr Thr Arg Asp Leu Leu Gin Glu 

135 140 
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Asn Ala Ser Leu Arg Arg His Val Ala Met Leu Glu Ala Gly Gin Arg 
145 150 155 160 

He Ala Thr Thr Leu Glu Arg Glu Lys Leu Ala Ser Ala Thr Thr Ser 
165 170 175 

Ala Leu Glu Ser Met Ala Cys Ala Ser Ala Val Val Leu Leu Glu Arg 
180 185 190 

Asp Gly Asp Lys Gly Leu Arg Leu Gin Gly Val Arg Gly Leu Pro Pro 
195 200 205 

Arg His Glu Pro Glu Leu Val Ala Thr Leu Val Glu Arg Leu Ser Asn 
210 215 220 

Ala Arg Ala Pro Arg Glu Leu Glu Gly Leu Asp Ala Ala Phe Pro Arg 
225 230 235 240 

Val Leu Ser lie Pro Ala Val Glu Gly Thr Thr Val Leu Gly His Ala 
245 250 255 

Val Leu Phe Phe Asp Gly Ala Glu Pro Glu Trp Thr Gly Glu Thr Ala 
260 265 270 

Gly Tyr Leu Val Arg Asn Trp Ala Leu Ala Leu Arg Asn Leu Gly Arg 
275 280 285 

Phe Ala Ala Val Glu Asp Leu Ala Tyr Val Asp Asp Leu Thr Arg Leu 
290 295 300 

Phe Asn Thr Arg Tyr Leu His Leu Val Leu Asp Arg Glu Val Gin Asp 
305 310 315 320 

Ser Val Gin Thr Gin Arg Pro Phe Ser Leu Leu Phe Leu Asp Leu Asp 
325 330 335 

His Phe Lys Ser lie Asn Asp Thr His Gly His Leu Val Gly Ser Arg 
340 345 350 

Leu Leu Val Glu Thr Ala Arg Val Val Lys Gly Cys Val Arg Asp His 
355 360 365 

Asp Val val Ala Arg Tyr Gly Gly Asp Glu Tyr Val Val Leu Leu Arg 
370 375 380 

Asn Thr Asp ser Gly Gly Ala Leu Lys Val Ala Glu Arg He Arg Arg 
385 390 395 400 
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Thr Met Glu Thr His Leu Phe Leu Ala Arg Glu Gly Leu Ser Leu Lys 
405 410 415 

Leu Ser Thr Cys He Gly Val Ala Ser Phe Pro Glu His Ala Arg Asp 
420 425 430 

Lys Ala Thr Leu Leu Asp Leu Ala Asp Arg Ala Met Tyr Arg Gly Lys 
435 440 445 

Arg Gly Ser Arg Asn Val Val Tyr Met Ala Ala Leu Asp Leu Glu Pro 
450 455 460 

Leu Pro Ala Glu Arg Arg Gin Gly Gly Ser Gly Pro Gly Gly Ser Ser 
465 470 475 480 

Thr 



<210> 3 
<211> 346 
<212> PRT 

<213> Angiococcus disciformis 
<400> 3 

Val Gly Gly Val Glu Ala Arg Leu Asp Ala Val Ala Asp Ala Phe Glu 
15 10 15 

Ala Gly Asp Phe Glu Ala Ala Leu Ala Gly Ala Glu Gly Leu Leu Ala 
20 25 30 

Asp Ala Pro Glu Leu Pro Glu Ala Leu His Phe Arg Ala Ser Ala Leu 
35 40 45 

Val Glu Leu Gly Arg Leu Glu Glu Ala Gly Lys Ala Phe Gly Gin Ala 
50 55 60 

Leu Lys Val Ala Pro Glu Asp Leu Glu He Leu Leu Ser Ala Ala Asp 
65 70 75 80 

Cys Leu Val Cys Arg Ala Gly Glu Asp Arg Glu Ala Val Ala Glu Gly 
85 90 t 95 



ru ."-^ imu. Cys Ala Arg Gly Arg Arg Leu Ala Gin Lys Ala Asp Asp 
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100 105 110 

Val Glu Met Leu Tyr Glu Phe Leu Leu Leu Glu Gly Met Gly Leu Asn 
115 120 125 

Gin Met Gly Glu Cys Ala Thr Ala Leu Val Ser Leu Asp Ala Ala Leu 
130 135 140 

Gly His Met Pro Arg Ser Leu Asp Ala Gin Val Glu Arg Gly He Ala 
145 150 155 160 

Leu Phe Glu Leu Cys Arg Phe Asp Glu Ala Lys Ala Ala Phe Glu Lys 
165 170 175 

Val Leu Lys Asp Ala Pro Asp Asp Pro Trp Ala His His Tyr Leu Gly 
18 0 185 190 

Leu He Ala Glu Arg Arg Gly Asp Glu Lys Glu Ala Lys Arg Arg Phe 
195 200 205 

Asp Lys Ala Arg Ala Leu Val Pro Glu Glu Phe Pro Pro Pro Val Glu 
210 215 220 

Leu Ala Glu Ala Glu Phe Asp Arg Ala Val Glu Asp Ala Val Lys Ser 
225 230 235 240 

Leu Pro Arg His Ala Lys Gin Tyr Leu Asp Asn Val Thr He Ala Val 
245 250 255 

Glu Asp Leu Pro Ser Asp Glu Asp Leu Leu Gly Gin Asp Pro Pro Leu 
260 265 270 

Ser Pro Ser He Leu Gly Val Phe Arg Gly Thr Pro Val Gly Glu Arg 
275 280 285 

Ser Val Met Asn Ala Tyr Glu Leu Pro Ala Ser He Val Leu Tyr Gin 
290 295 300 

Arg Asn Leu Glu Arg Phe Ala Arg Thr Arg Glu Glu Leu He Glu Gin 
305 310 315 320 

He Gly He Thr Val Met His Glu Val Gly His Leu Met Gly Leu Asp 
325 330 335 



Glu Asp Asp Leu Trp Gin Arg Gly Leu Asp 
3 *0 345 
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<210> 4 
<211> 214 
<212> PRT 

<213> Angiococcus disciformis 



<400> 4 

Met Gly He Ser Glu Arg Lys Glu Arg Gin Arg Ala Glu Leu Arg Glu 
1 5 10 15 

Gin lie Leu Gin Val Ala Lys Asp Met Val Thr Arg Glu Gly Phe Gly 
20 25 30 

Ala Leu Ser Met Arg Lys Leu Ala Asp Ala Val Glu Tyr Ala Pro Ala 
35 40 45 

Thr Leu Tyr Leu His Phe Glu Asn Arg Asp Ala He Ala Arg Glu Leu 
50 55 60 

Cys He Arg Gly Phe Gin Asp Leu Leu Glu Ala Phe Glu Pro Ala Ala 



65 70 75 



80 



Ser Val Glu Glu Pro Val Glu Arg Leu Tyr Arg Leu Gly Glu Ala Tyr 
85 90 " 95 

Val Lys Phe Gly Leu Glu Gin Pro Glu Thr Tyr Arg Leu He Phe Met 
100 105 HO 

Glu Asp Pro Lys Leu Ser Thr Ala Leu Phe Arg Asp Ala Pro Glu Asp 
115 120 125 

Gly Ala Gly Pro Arg Ser Phe Gly Val Leu Val Lys Val Phe Glu Asp 
130 135 140 



Leu Lys Ala Ala Gly Arg He Ala Glu Asp Ala Glu Pro Ser Lys Leu 
145 150 155 160 

Ala Glu Val Leu Trp Ala Gly Val His Gly He Val Ala Leu Arg Leu 
165 170 175 

Thr Cys Thr Gly Phe Lys Gly Ser Pro Pro Glu Glu Leu Ala Arg Leu 



180 185 



190 



Leu Val Ser Thr Met Val Asn Gly Leu Pro Gly Leu Lys Pro Val Lys 
195 200 205 
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Ala Ala Arg Lys Thr Arg 
210 

<210> 5 

<211> 378 

<212> PRT 

<213> Angiococcus disciformis 



<400> 5 

Met Thr Lys Trp Ser Gly Lys Trp lie Gly Gly Arg Thr Tyr Thr Ala 
15 10 15 

Thr Asp Gly Ser Thr Arg Trp lie lie Arg Lys Thr Val Ala Gly Val 
20 25 30 

Ala Tyr Asn Val Thr Leu Asp Val Arg Ser Glu Ala Glu Ala Leu Ala 
35 40 45 

Glu Leu Ala Ala Phe Arg Arg Asn Pro Ala Ala Tyr Arg Thr Ala Ser 
50 55 60 ~ 

Gin Glu Arg Gin Asp Glu Ala Gin Gin Ala Gin Glu Asp Ala Ala Gin 
65 70 75 " 80 

Ala Val Phe Leu Asp Glu Asp Arg Ala Arg Arg Phe Leu Gin His Leu 
85 90 95 

Lys Ser Ser Gly Arg Ser Asp Glu Tyr Leu Lys Ser Thr Arg Ser Tvr 
100 105 ~ HO 

Leu Ala Ala Trp Ala Thr Ala Leu Ala Asp Lys Asp Leu Arg Ala Val 
115 120 125 

Thr Gly Pro Ala Leu Lys Lys lie Leu Ala Thr Trp Asp Thr Gly Lys 
130 135 140 

Lys Trp Arg He lie Val Phe Lys Ser Phe Thr Ser Phe Leu Gin Asn 
145 150 155 16? 

Ser Gly Glu Leu Asp Pro Met Val Asn Pro Gly Arg Phe Leu Lys Val 
165 170 1 75 



Pro Pro Ala Arg Arg Asn His Glu Leu Lys Gly Tyr Ser He Glu His 
180 185 190 
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Val Gin Gin Leu Tyr Ala Ala Leu Gly Ser Gin Ala lie Arg Asp Val 
195 200 205 

Leu Cys Leu Gin Ala Lys Thr Gly Met His Gly Thr Glu Val Asp Arg 
210 215 220 

Leu Ala Ser Gly Asp Gly Lys He Ala Val Leu Lys Asp Gin Gly Glu 
225 230 235 240 

He Ala Ala Thr Val Thr Phe Lys His Lys Thr Gly Lys Pro Phe Thr 
245 250 ' 255 

Leu Ser Leu Asp Val Gin Gly Leu Ala Ala Ala Gin Arg Leu Gin Ala 
260 265 270 

Arg Gly Ser Ala Pro Asp Arg Gin Thr He Arg Glu Ala Val Glu Arg 
275 280 285 

Val Cys Ala Arg Thr Ser Leu Glu Phe Val His Phe Gly Ala He Arg 
290 295 300 

His Ser Phe Ala Thr Trp Leu Val Glu Gin Gly Ser He Tyr Asn Pro 
305 310 315 320 

Lys Gly Gly Gly Leu Ser Leu Glu Ala Val Ala Gin Ala Leu Asn His 
325 330 335 

Ser Ser Thr Arg Thr Thr Ala Leu His Tyr Val Ser Ala Thr Val Pro 
340 345 350 

Pro Met Tyr Val Val Pro He Arg Leu Glu His Ser Asn Asp Pro Ser 
355 360 365 

Glu Leu Arg He Ala Arg Ala Pro Ala Gly 
370 375 



<210> 6 



-.^ j-j-.* Jon 



<212> PRT 

<213> Angiococcus discif ormis 



as:-- T-ir .".sp Asp Ser Thr Pro Thr Asp Glu Thr Gin He Glu Ala Pro 
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1 5 10 15 

Thr Pro Glu Asn Thr Ser Asp Leu Lys Gly Ala Arg Pro Arg Leu Arg 
20 25 30 

Phe Thr lie Glu Thr Thr Thr Tyr Glu Thr Leu Arg Thr Leu Glu Arg 
35 40 45 

Arg Gly Asn Gly Glu Val Val Leu Leu Ala Glu Arg His Leu Pro His 
50 55 60 

Gly Leu Ala Gly Leu Val Thr lie Lys Arg Leu Arg Asn Pro Val Thr 
65 70 75 80 

Phe Glu Arg Cys Gin Arg Leu lie Glu Glu Val Gin Leu Ser Phe Arg 
85 90 95 

Leu His His Pro Ala He Ala Gin Val His His Leu Lys He His Ala 
100 105 HO 

Asp Arg Pro His Val He Ala Glu Tyr Val Asp Gly Pro Thr Leu Asp 
115 120 125 

Thr He He Ser Leu Ala Thr Met Arg Glu Lys Pro Leu Ser Ala Pro 
130 135 140 

Phe Ala Leu Tyr He Ala Ala Glu Val Ala Asp Ala Leu His His Ala 
145 150 155 160 

His Thr Leu Arg Asp Ser Glu Asn Arg Pro Leu Gly He He His Arg 
165 170 175 

Asp Val Ala Pro Arg Asn He Arg Val Ala Arg Ser Gly Glu Val Lys 
180 185 190 

Val Thr Asp Phe Gly Ala Thr Tyr Ser Leu Met Val Gly Arg Glu Glu 
195 200 205 

Thr Pro Gly Leu Leu Leu Lys Gly Asp Val Ala Tyr Ala Ser Pro Glu 
210 215 220 

Tyr Leu Asn Arg Lys Pro Met Asp Gly Arg Ser Asp He Phe Ser Leu 
225 230 235 240 



Gly Leu Val Leu Met Glu Met Leu Thr Cys Lys His Leu Phe Asp Val 
245 250 255 
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Glu Asp Glu Lys Ala Pro Asn Ala Thr Val Asp Val Lys Thr Glu Glu 

260 265 270 

Thr Pro Ser Val Pro Leu Thr Gin Met lie Ala Leu Val Asn Arg Tyr 
275 280 285 

Arg Pro Glu Asp Val Glu Asn Ala Met Ala Gly Leu Pro Asp Ala Leu 
290 295 300 

Lys Ala lie Val His Lys Ala Leu Gin Arg Lys Leu Ser Glu Arg Tyr 
305 310 315 320 

Ala Thr Ala Ala Glu Met Arg Asp Ala Leu Arg Ala Ala Leu Ala Ala 
325 330 335 

Glu Ser Gin Pro Phe Gly Arg Lys Glu Ala Ser Glu Glu Leu Ala Arg 
340 345 350 

Met Leu Ser Glu Ala Ser Val Leu Arg Asp Arg Val Glu Leu Asp Glu 
355 360 365 



375 380 



370 


Pro Asn Glu Glu 


385 




<210> 


7 


<211> 


640 


<212> 


PRT 


<213> 


Angiococcus 


<400> 


7 



Val Glu Glu Lys Leu Gly Ala Gly Gly Phe Gly Ala Val Tyr Arg Ala 
1 5 10 15 

Arg Arg Gly Gly Arg Leu Tyr Ala Leu Lys Leu He Pro Leu Trp Gly 
20 25 30 

Leu Ala Glu Trp Ala Glu Arg Glu Val Ala He Leu Leu Arg Leu Lys 
35 40 45 

Kis Ser Asn Leu Val Arg He Arg Gly His Gly Gin Trp Pro Asp Glu 
5 " 55 60 
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90 

Ala Pro Gin Ser Phe Phe lie Val Met Asp Tyr Val Glu Gly Arg Arg 
65 70 75 ~ 80 

Leu Asp Val Trp Ala Arg Glu Glu Asn Pro Ser Ala Arg Glu Val Val 
85 90 95 

Leu Lys Val Arg Gly Val Ala Arg Gly Leu Gly Ala Ala His Arg Ala 
100 105 110 

Lys val Val His Arg Asp Leu Lys Glu Ser Asn Val lie Glu Arg Thr 
115 120 125 

Ser Asp Gly Glu Ala Val Val Val Asp Phe Gly Ala Gly Gly Tyr Glu 
130 135 140 

Ser Ala Pro Ser He Thr Gly Gly Val Leu Pro Pro Gly Thr Pro Glu 
145 150 155 160 

Tyr Arg Ala Pro Glu Ala Trp Arg Phe Gin Gin Glu His Gly Asp Glu 
165 170 175 

Arg Gly His Ser Tyr Gin Pro Gly Pro Ser Asp Asp Leu Tyr Ser Leu 
180 185 190 

Gly Val Val Leu Tyr Trp Leu Leu Thr Gly Arg Gin Pro Phe Leu Pro 
195 200 205 

Asp Glu Ala Ala Gly Val Glu Ala Val Leu Asn Arg Ala Pro Lys Pro 
21° 215 220 

Pro His Val Leu Asn Pro Arg Val Pro Glu Ala Leu Ser Ala Val Cvs 
225 230 235 240 

Met Arg Leu Leu Ala Lys Glu Pro Glu Glu Arg His Pro Asp Ala Asp 
245 250 255 

Ala Leu Cys Ala Glu Leu Glu Ala Leu Leu Ala Gin Ala Asp Glu Ser 
260 265 270 

Trp Asp Val Lys Leu Cys Asp Ala Tyr Gly Pro Asp Thr Ala Thr Thr 
275 280 285 

LSU oo« Ala Val Pr ° His Ala G1 * Glu As P Glu L ^u Val Gin Trp Leu 
290 295 300 

Lys Lys Arg Lys Ala Arg Pro Arg Arg Gly Pro Arg Pro Pro His Gly 
305 310 315 320 
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Glu Gly Ala His Glu Pro Asp Ser Asn Ala Val Ala lie Gin Ala Glu 
325 330 33 5 

Leu Pro Pro Pro Val Pro Ala Arg Pro Gly Pro Met Pro Ser Pro Pro 
340 345 350 

Pro Ala Gin Ala Gly His Phe Lys Ala Leu Arg Val Ala Thr Trp Met 
355 360 365 

Gly Leu Leu Val Val Leu Ser Ala Gly Ala Leu Val lie Ala Arg Thr 
Leu Ala Pro Ser Pro Ala Pro Ala Ala Arg His Ala Leu Glu Pro 



385 390 



His 

395 400 



Pro Ser Asp Ala Val Asp Ser Pro Pro Ala Ser Pro Glu Phe Asp Leu 
405 410 415 

Ala Pro Trp Ala Pro Gly Gin Glu Val Ala Ala Pro Trp Met Ala Pro 
420 425 430 

Glu Ala Gin Glu Ala Ala Ala Ala Leu Asp Ser Ala Pro Thr Leu Ala 
435 440 445 

Ala Val Ala Leu Pro Ala Thr He Ser Glu Glu Lys Ala Ser Val Lys 
43U 455 460 

Met Lys Lys Asn Thr Gly Val Phe Ser Glu Pro Glu Pro Gin Arg Phe 
' tjJ 470 475 480 

Arg Gly Gly Thr Val Gly Lys Ala Leu Gly Leu Gly Val Ala Gly Cys 
48 5 490 495 

Leu Ala Met Ala Cys Pro Gly Pro Gin Val Arg Pro Thr Pro Pro Pro 
500 505 510 

Glu Ala Cys Pro Pro Gly Ala Val Glu Ala' Met Ser Arg Leu Arg He 
515 520 525 

Glu Tyr Asp Glu Asp Val Val Ala Thr Phe Phe Ala Thr Ala Glu Glu 
530 535 540 



Gly val Ser Arg Asn lie Thr Val Arg Ala Gly Pro Thr Thr Val His 

560 



545 550 555 



Gly Arg Pro Leu Gly A sn Leu Pro Ala Arg Thr Thr Leu Ser Gly 
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565 570 575 

Arg Leu He He Gly Glu Asp Arg Val Tyr Gly Arg Leu Thr Glu Ala 
580 585 590 

Arg Thr Pro Lys Gly Asp Arg Phe Pro Val Cys Met Gin He He Glu 
595 600 605 

Asn Asp Ser He Gly Phe Lys Lys Glu He Asn Asp Gly Ser Asn Ser 
610 615 620 

Ala Ser Val Tyr Pro Tyr Phe Asp Val Lys Ala Val Arg Arg Phe Glu 
625 630 635 640 

<210> 8 

<211> 306 

<212> PRT 

<213> Angiococcus disciformis 



<400> 8 

Met Ser Pro He Val Val Leu Leu Val He Ala Leu Phe Ser Gly Ala 
1 5 10 15 

Ala Ala Ala Gin Ser Arg Pro Ala Ala Thr Gly Leu Gly Val Arg Arg 
20 25 30 

He Glu Phe Ser Ala Glu Asn Phe Glu Ala Met Thr Thr Pro Glu Val 
35 40 45 

Gin He Ser Pro Gly Val Ser Thr Thr Phe Glu Phe Asn Ser Ala Leu 
50 55 60 

Leu Gin Glu Lys Val Ala Val Glu Gly Ala Asp Arg Phe Ser Leu Val 
65 70 75 " 80 

Asp He Gly Arg Ser Thr Leu Arg Leu Val Pro Ser Glu Gin Val Leu 
85 90 95 

Pro Gly Glu Arg Leu Arg Val Thr Val Arg Phe Leu Asp Gly Ala Ala 
100 105 110 



Pro Val Gly Met Ala Phe He Leu Val Ala His Pro Ala Gin Ala Glu 
115 120 125 



WO 2004/022586 AM PCT/EP2003/009780 
W 93 ^ 

Arg Leu Val Glu Val Trp Arg Asn Gin Arg Thr Val Glu Ser Tyr Gin 
130 135 140 

Gin Glu Ser Lys Glu Ala Arg Ala Glu Ala Gin Arg Cys His Glu Glu 



145 150 155 



160 



Asn Glu Arg Leu Arg Ala Glu Tyr Ala Gly Pro Gly Gly Leu Ala Gly 
165 170 175 

Leu Leu Ala Asn Arg Val lie Gly Lys Ser Gly Val Ser Val Lys Pro 
180 185 190 

Leu Asp Phe Glu Lys Glu Val Arg Gin Arg Pro Gly Asp Ala lie Arg 
195 200 205 

lie Leu Arg Ala Trp Ser Tyr Arg Ser Ala Asn Arg Val Ser Val Leu 
210 215 220 

Met Asp Leu Asp Tyr Pro Glu Ala Ala Gin Arg Trp lie Ala Gin Gly 
225 230 235 240 

Ala Ser Leu Val Ser Lys Thr Gly Glu Thr Leu Ser Met Leu Pro Val 
245 250 255 

Trp Gin Glu Ala Pro Val Ala Asn Ala Arg lie Arg Arg Leu Val Val 
260 265 270 

Glu Ala Glu Ala Glu Pro Asp Ala Ala Gin Asp Thr Phe Thr Leu Lys 
275 280 285 

Leu Trp Glu Ser Asn Gly Leu Arg Thr lie Thr Leu Ser Gly Val Thr 
290 295 300 

Phe Pro 
305 



<210> 9 
<211> 180 

<213> Angiococcus disciformis 

-,|.:. V > Q 

Met Tar Leu Tyr Glu Leu Phe Glu Ala Asp Asp Asp Arg Asp Val Pro 
5 10 15 
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Ala Asp Phe Pro Trp Leu Leu lie Glu He Pro Glu Glu Gin Leu Arg 
20 25 30 

His Ala Ala Leu Leu Leu Asn Glu Gly Glu Trp Arg Pro Ser His He 
35 40 45 

Ser Gly He Trp Tyr Arg Val Asp Pro Glu Arg Pro Ala Gin Lys Gin 
50 55 60 

Gin Arg His Val His Val Ala Ala Lys Lys His He Lys He Pro Thr 
65 70 75 80 

Lys Gin Ala Ser Trp Asn Arg Asp Thr Thr Arg His Asp Arg Lys Thr 
85 90 95 

Phe Asn Ala Lys Leu Gly Ser Gin Gly Ser Tyr Gin Asp Val Ala Lys 
100 105 110 

Thr Ala Leu Gly Leu Pro Pro Glu Ala Val Leu Glu His Val Val Glu 
115 120 125 

Asn Pro Ala Glu Gin Gin Met Leu Leu Thr Glu Ser Thr Asp Pro Gin 
I 30 135 140 

Leu Lys Thr Glu Phe Tyr Arg Trp Leu Glu Ala Gin Pro Arg Arg Ser 
145 I 50 155 160 

Asn Phe Glu Ala Leu Val Glu Asp Ala Val Arg Leu Asn Leu Pro Leu 
165 170 175 

Asp Leu Glu Ser 
180 

<210> 10 
<211> 390 
<212> PRT 

<213> Angiococcus disciformis 



<400> 10 

Val Ala Leu Gin Arg Arg His Pro Leu Leu Arg Ala Lys He Val Ala 
15 10 15 

Gly Thr Pro Pro Arg Phe Thr Ser Gin Gly Val Pro Pro He Ala Leu 
20 25 30 
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Glu Val Val Glu Arg Lys Ala Glu Glu Ser Trp Gin Gin His Val Glu 
35 40 45 

Ala Glu Leu Asp Arg Pro Phe Asn Ser Asp Pro Gly Pro Leu Val Arg 
50 55 60 

Phe lie Leu Val Arg Gly Glu His Arg Ser Glu Leu Leu Cys Ser Tyr 
65 70 75 80 

Asp His Leu He Gly Asp Ala His Ser Gly He Phe Ala Leu Arg Asp 
85 90 95 

Leu Leu Gin Val Met Ala Ser Gin Asp Gin His Leu Pro Glu Leu Ala 
100 105 110 

Pro Arg Pro Ala Tyr Glu Glu Leu He Gly Pro Met Val Pro Gly Thr 
115 120 125 

Arg Leu Leu Gly Ala Ala Val Arg Arg Gly Ser Ser Ala Leu Leu Arg 
130 135 14Q 

Leu Ser Pro Thr Leu Asn Thr Leu Thr Glu Arg Leu Val Ala Pro Arg 
145 150 155 160 

Gly Met Ala Arg Ser Ser Ala Glu Gly Gin Ala Leu Tyr Ser His Arg 
165 170 175 

He Leu Glu Pro Glu Gin Leu Ala Arg Leu Leu Ala Arg Cys Arg Glu 
180 185 190 

Gin Gly Ser Ser Val His Ala Ala Leu Gly Ala Ala Leu Leu He Ala 
I 95 200 205 

Arg Ala Glu Ser Gin Gly Ala Lys Lys Arg Val Thr Leu Thr Leu Thr 
210 215 220 

Ser Ala Leu Asp Ala Arg Glu Arg Phe Gly Val Gly Glu Asp Phe Gly 
225 230 235 " 240 

Leu Phe Thr Thr Gly Lys Thr Asp Phe Phe Arg Ala Arg Gly Ser Thr 
245 250 " 255 

Pro Phe Trp Glu Leu Ala His Arg Leu Arg Ser Pro Leu Gin Ala Ala 
260 265 270 



Arg Lys Lys Arg Ser His Leu Arg Leu Phe Arg Leu He Leu Glu He 
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275 



96 

280 



PCT/EP2003/009780 

285 



Ser Ala Leu Thr Val Asp He Ser Ser Gin Pro Trp Met Glu Arg Ala 
290 295 300 

Thr Arg Leu Gly Leu His Ser Met Leu Ala Leu Ser Asn Val Gly Arg 
305 310 315 320 

Val Glu He Ala Ala Arg Tyr Gly Asn Leu Thr Leu Glu Gly Leu Gly 
325 330 335 

Phe Thr Gly Thr Thr Ala Ala Gin Phe Asp Val Val Leu Thr Ala He 
340 345 350 

Thr Phe Ala Gin Arg Leu Glu Ala Ser Phe Leu Phe Asn Thr Ala His 
355 360 365 

Met Gin Arg Glu Gin Val Glu Gin Leu Ala Ser Arg Thr Trp Glu Leu 
370 375 380 

Leu Ala Glu Ala Thr Arg 
385 390 



<210> 11 

<211> 1541 

<212> PRT 

<213> Angiococcus disci formis 

. <400> 11 

Met Thr Leu Asp Ser Arg Tyr Ala Asn Arg Gly Gly Gly Leu Tyr Glu 
1 5 10 15 

He Asp Gly Leu Ala He Gly His Ser Pro Pro Pro Cys Ser He Pro 
20 25 30 

Gly Gly Val Gin Leu Thr Ala Thr Ala Thr Ala Glu His Ala Leu Lvs 
35 40 45 

Ala Leu Arg Ala Lys Gly Leu Ala Thr Ser Asp Glu Ala Leu Pro Ala 
50 55 60 

Val Arg His Ala Ser Gly Glu Lys Ala Pro Leu Ser Ser Ser Gin Ara 
65 70 75 80 
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Arg Met Trp Phe Leu Glu Gin Leu Glu Pro Gly Asn Ala Ala Gin His 
85 90 95 

Leu Leu Gin Ser His His lie Gin Gly Pro Leu Gin Val Glu Gly Leu 
100 105 110 

Arg Arg Ala Leu Gly Leu Met Val Lys Arg His Glu Ala Leu Arg Leu 
115 120 125 

Val yal Ala lie Gly Ala Ala Gly Pro Glu Gin Arg Val Arg Pro Glu 
1JU 135 140 

Trp Trp Pro Glu Leu Pro Leu Ser Asp Leu Ser Gly Val Ala Gin Ala 



150 i 5 5 



160 



Asp Arg Ala Arg Ala Leu Ala Glu Leu Ala Arg Val Glu Ala Ala Ala 
165 170 175 

Pro Phe Asn Leu Gin Gin Gly Pro Leu Phe Arg Val Arg Leu Ala Arg 
180 185 19Q 

Leu Ala Glu Thr Glu His Val Leu Leu Val Thr Leu His His Leu lie 
195 200 205 

Ser Asp Gly Ala Trp Ser Cys Glu Val Leu lie Lys Glu Leu Ala Met 
210 215 220 

Leu Tyr Gly Gin His Val Glu Gly Ser Ser Pro Glu Leu Ala Pro Leu 

Pro Val Gin Tyr Arg Asp Tyr Ala Gly Trp Glu Ala Ser Phe Ala Pro 
245 250 255 

Ala Gly Glu Ala Leu Glu Ala Trp Trp Arg Gin Arg Leu Ala Gly Val 
260 265 270 

Pro Thr yal Leu Glu Leu Pro Thr Glu Gly Ala Arg Pro Leu Arg Gin 
275 280 285 

Thr Tyr Arg Ala Gly Arg Val Ala lie Thr Val Gin Pro Arg Leu Ara 
290 295 300 

Gin Ala Leu Glu Asp Leu Ala His Lys Glu Gly Val Ser Leu Phe Ala 
305 3 1° 315 320 

Leu Leu Leu Thr Ala Phe Thr Thr Leu Leu His Arg Tyr Ser Arg Gin 
325 330 335 
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Glu Glu Leu Val Val Gly Trp Pro Ala Pro Gin Arg Pro Arg Pro Glu 
340 345 350 

Leu His Gly Leu He Gly Tyr Phe Gly Thr Pro Val Ala Leu Arg Ser 
355 360 365 

Arg Leu Glu Pro His Thr Arg Val Arg Glu Ala Leu Arg Gin Leu Asp 
370 375 380 

Gin Glu Val Arg Glu Ala Asn Ala His Ala Ala Leu Pro Phe Glu Arg 
385 390 395 400 

Leu Val Ser Leu Leu Asp He Ala Arg Ser Pro Ser Arg His Pro Leu 
405 410 415 

Phe Gin Val Leu Phe Asp Leu Leu Pro Glu Gin Pro Gin Pro Thr Ala 
420 425 430 

Gly Gly Cys Ala Phe Arg Pro Trp Glu Ser Phe Thr Gly Leu Val Ala 
4 35 440 445 

Tyr Asp Leu Thr Leu Leu Leu Glu Pro Arg Gly Glu Gly Leu Glu Gly 
450 455 460 

Ala Leu Asp Tyr Ser Ala Asp Leu Phe Ser Glu Ala Arg Met Gin Arg 
455 47 ° 475 485 

Ala Ala Thr Gin Tyr Leu His Leu Leu Glu Gin Leu Val Glu Arg Pro 
485 490 495 

His Glu Arg Leu Ser Arg Leu Ala Leu Leu Thr Pro Gly Glu His Glu 
500 505 510 

Ala Leu Leu Ala Gly His Gly Leu Gly Gly Pro Val Glu Gly Ala Gin 
515 520 525 

Val Leu Ala His His Arg Phe Glu His Gin Val Arg Leu Thr Pro His 
530 535 540 

His Pro Ala Leu Cys Phe Gly Pro Gin Val Leu Ser Tyr Glu Gin Leu 
34:3 550 555 560 



S Ala Asn Pro Leu Ala His Arg Leu Arg Arg Leu Gly Ala 
565 570 575 

Pro Asp Thr Leu Val Gly Leu Cys Val Glu Arg Ser Leu Glu Leu 
580 585 590 



WO 2004/022586 |B PCT/EP2003/009780 
W 99 W 

Pro Val Ala Leu Leu Ala He Trp Lys Ala Gly Ala Gly Phe Leu Pro 
595 600 605 

Leu Asp Val Asn Gin Pro Arg Glu Arg Leu Ala 'Phe Leu Leu Gly Asp 
610 615 620 

Ala Ser Cys Arg He Leu Leu Thr Gin Glu His Leu Leu Gin Arg Leu 
625 630 635 640 

Pro Pro Thr Asn Ala Ala Leu Leu Cys Leu Glu Arg Glu Ala Glu Ala 
645 650 655 

Leu Glu Arg Glu Pro Gin Glu Asp Ala Pro His Glu Ala Gly Leu Asp 
660 665 670 

Asn Leu Ala Tyr Val He His Thr Ser Gly Ser Thr Gly Thr Pro Lys 
675 680 685 

Gly lie Ala Met Val His Arg Cys Leu Ala Asn Leu Val Ala Trp Gin 
690 695 700 

Leu Thr His Glu Arg Leu Gly Gly Pro Ser Arg Thr Leu Gin Phe Ala 
705 71 0 715 720 

Ser Leu Asn Phe Asp He Cys Tyr Gin Glu Leu Phe Thr Thr Trp Ala 
725 730 735 

Ala Gly Gly Thr Val Val Met Val Thr Glu Glu Val Arg Arg Asp Pro 
740 745 750 

Ala Arg Leu Leu Glu Val Leu Glu Gin Glu Gin Val Ser Arg Leu Tyr 
755 760 765 

LeU PhS 116 Ala LeU Gln Gln Leu Ala Ar ^ val Ala Asp Glu Arg 

770 775 780 

Gly Ala Ala Pro Arg His Leu Arg Gln Leu He Thr Ala Gly Glu Gln 
785 790 795 " 



800 



Leu Gln Ala Thr Pro Glu Leu Gln Arg Leu Leu Ser Arg Met Pro Glu 
805 810 ~ 815 

Cys Thr Leu His Asn Gln Tyr Gly Pro Ser Glu Cys His Val Val Thr 
820 825 830 



Ser His Asp Leu Thr Arg Glu Pro Ser Arg Trp Pro Arg Leu Pro 
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100 

83 5 840 845 

Val Gly Arg Pro Leu Ala His Leu Arg Val Leu Leu Leu Asp Gly Glu 
850 855 860 

Gin Gin Leu Val Pro Pro Gly Val Ala Gly Glu Val Phe Leu Gly Gly 
865 870 875 88 b 

Pro Ala Leu Ala Arg Gly Tyr Leu Gly Arg Pro Glu Gin Thr Ala Asp 
885 890 895 

Arg Phe Val Pro Asp Pro Phe Ser Arg Glu Pro Gly Ala Arg Leu Tyr 
900 905 910 

Arg Thr Gly Asp Leu Ala Arg Leu Arg Glu Asp Gly Ala Leu Glu Phe 
915 920 925 

Leu Gin Arg Met Asp Ala Gin Val Lys He Arg Gly Tyr Arg He Glu 
93 0 935 940 

Pro Gly Glu He Glu Val Val Leu Cys Glu His Pro Ala Val His Gin 
945 950 955 960 

Ala His Val Arg Pro Tyr Val Asp Ser Ala Gly Glu Arg Arg Leu Val 
9 65 970 ~ 975 

Ala Tyr Val Ala Ala Arg Leu Glu Asp Thr Asp Gly Ala Glu Thr Glu 
98 0 985 990 

His Val Glu Arg Trp Arg Ala Val Trp Asp Glu Thr Tyr Gly Gly Pro 
995 1000 10 05 

?Sfn Ala Phe ASP Leu Ala Gly Trp Asn As P Ser Val Arg 

1010 1015 1020 

Gly Pro Leu Pro Pro Glu Gin Met Arg Glu Trp Val Glu Thr 

1025 1030 1035 

Y^o LSU MSt G1U L6U Val Pro A ^ Val Leu Glu 

1040 1045 1050 

Leu Gly Cys Gly Ser Gly Leu Leu Leu Arg Arg Leu Ala Pro Arg 
1055 1 060 1Q65 

° yS SSr ^ Trp Gly Thr Glu Leu Ser p ro Val Ala val Glu 

10 ,J 1075 1080 
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Arg Leu Arg Glu Gin Leu Gin Thr Gly Gly Ser Pro Leu Ala Gin 
1085 1090 1095 

Arg Val Arg Leu Met Ala Gin Pro Ala Asp Asp Phe Ser Gly Leu 
1100 1105 1110 

Pro Glu Ala Gly Phe Asp Thr Val lie Leu Asn Ser Val Thr Gin 
1115 1120 1125 

Leu Phe Pro Ser Val Asp Tyr Leu Leu Arg Val Val Glu Gly Ala 
1130 1135 1140 

Leu Arg Val Leu Gin Pro Gly Gly Thr Leu Phe lie Gly* Asp Val 
1145 1150 1155 

Gin Asn Leu Arg Leu Phe Glu Leu Phe His Ala Ser Val Ala Leu 
1160 1165 1170 

Glu Gin Ala Ser Ala Asp Leu Glu Ala Pro Ala Leu Leu Ala Arg 
1175 1180 1185 

Thr Arg Gin Arg Met Leu Leu Asp Glu Arg Leu Tyr Val Asp Pro 
1190 1195 1200 

Asp Phe Phe Ala Ala Leu Ala Thr His Phe Pro Gin Leu Gly Ala 
1205 1210 1215 

Val Arg Leu His Leu Lys Arg Gly Ser Gly Arg Asn Glu Met Asn 
1220 1225 1230 

Arg Phe Arg Tyr Asp Val Glu Leu Gin Leu Ala Pro Val Ala Lys 
1235 1240 1245 

Ala Gly Pro Ala Gin Glu Leu Pro Glu Leu Asp Trp Arg His Glu 
1250 1255 1260 

Gly Leu Gly Leu Glu Arg Leu Glu Arg Met Leu Ala Glu Arg Pro 
1265 1270 1275 

Ala Gly Leu Val Leu Arg Asn Val Ala Asn Ala Arg Thr Ala Asp 
1280 1285 1290 

Glu Ala Ala Arg Leu Ala Leu Leu Arg Thr Gly Asn Ser Val Gly 
1295 1300 ~ 1305 



Arg Leu Arg Ala Leu Pro Ala Val Thr Ser Trp Asn Pro Glu Gin 
1310 1315 1320 
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Leu Trp Arg Leu Ala Glu Ala Ala Asp Tyr Thr Cys His Val Thr 
1325 1330 1335 

Trp Ser Ala Gin Asp Glu Glu Gly Arg Phe Asp Ala Leu Leu Met 
1340 1345 " 1350 

Ala Arg Ala Ala Gly Ser Ser Arg Pro Ala Ala Trp Leu Thr Pro 
1355 1360 1365 

Pro Pro Pro Pro Pro Arg Pro Trp Lys Ser Tyr Ala Asn Gin Pro 
1370 1375 1380 

Leu Ala Ala Ser Arg Arg Arg Thr Leu Val Gly Val Leu Arg Ser 
1385 1390 1395 

His Leu Glu His Lys Leu Pro Glu Tyr Met Val Pro Ser Ser Phe 
1400 1405 1410 

Val Leu Leu Asp Ala Leu Pro Leu Lys Pro Thr Gly Lys Leu Glu 
1415 1420. 1425 

Val Ala Ala Leu Pro Pro Pro Glu Pro Ala Gin Ala Glu Gin Thr 
1430 1435 1440 

Pro Gly His Leu Ala Pro Arg Thr Pro Thr Glu Arg Arg Leu Ala 
1445 1450 1455 

Glu Leu Trp Arg Arg Val Leu Gly Val His Arg Val Gly Val Glu 
1460 1465 1470 

Asp Asn Phe Phe Gin Leu Gly Gly His Ser Leu Leu Ala Thr Arg 
1475 1480 1485 

Leu Leu Ser Leu lie Arg Gly Glu Leu Gly Leu Glu Leu Pro Leu 
1490 1495 " 1500 

Arg Val Leu Phe Glu Gin Pro Thr Leu Ala Ala Met Ala Gly Cys 
1505 1510 1515 

Leu Glu Ala Gin Ser Trp Ser Thr Gin Ala Pro His Pro Pro Ser 
1520 1525 1530 

Ala Ser lie Glu Glu Gly Glu Leu 
1535 1540 



<210> 12 
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^ 103 

<211> 3515 
<212> PRT 

<213> Angiococcus discif oralis 



<400> 12 

Met Thr Pro Ser Gly Asp Glu Ala Leu Gin Ser Ser lie Ala Leu Val 
1 5 10 15 



Gly Met Ala Gly Arg Phe Pro Gly Ala Pro Asp Val Glu Ser Phe Trp 
20 * 25 30* 



Arg Asn Leu Val Ala Gly Val Glu Ser lie Ser Phe Phe Ser Glu Glu 
35 40 45 



Glu Leu Arg Gin Ala Gly Val Ser Glu Gin lie Arg Arg Arg Pro Glu 
50 55 60 



Tyr Val Pro Ala Lys Gly Val Leu Glu Asp Leu Glu Leu Phe Asp Ala 
65 70 75 80 



Gly Phe Phe Gly Tyr Ser Pro Arg Glu Ala Ser His Leu Asp Pro Gin 
85 90 95 



Gin Arg Leu Leu Leu Glu Cys Ser Trp Glu Ala Leu Glu Asp Ala Gly' 
100 105 110 



Leu Arg Pro Asp Gin Leu Pro Gly Trp Val Gly Val Tyr Val Gly Ala 
115 120 125 



Gly Asp Thr Ser Tyr Arg Phe Gin Leu Leu Arg Gly His Gly Asp Pro 
130 135 140 



Leu Ser Gly Ser Lys Asp Val Ala Gly Phe Phe Gly Asn Tyr Pro Asp 
145 150 155 " 160 



Phe Leu Ala Thr Arg Val Ala Tyr Lys Leu Asn Leu Arg Gly Pro Ala 
165 170 175 



Leu Gly lie His Thr Ala Cys Ser Thr Ser Leu Val Ser lie Asn Met 
180 185 190 



Ala Cys Ser Ala Leu Arg Gly Phe Glu Cys Asp Met Ala Leu Ala Gly 
195 200 ' 205 



Gly Val Ser Leu Arg Leu Pro Ala Arg Ser Gly Tyr Leu Tyr Glu Glu 
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210 215 220 

Gly Gly Val Ala Ser Lys Asp Gly His Cys Arg Pro Phe Asp Ala Arg 
225 230 235 240 

Ala Thr Gly Thr Val Thr Gly Asp Gly Val Gly Val Val Val Leu Lys 
245 250 255 

Arg Leu Glu Asp Ala Leu Lys Ala Arg Asp Pro lie His Ala Val lie 
260 265 270 

Arg Gly Trp Ala Leu Asn Asn Asp Gly Ala Ser Arg Ala Gly Phe Thr 
275 280 285 

Ala Pro Ser Val Glu Gly Gin Ser Glu Val lie Ala Leu Ala His Ala 
290 295 300 

Ala Ala Gly lie Ser Ala Arg Asp lie Thr Tyr Val Glu Ala His Gly 
305 310 315 320 

Thr Gly Thr Pro Leu Gly Asp Pro lie Glu Val Ala Ala Leu Thr Arg 
325 330 335 

Ala Phe Arg Ala His Thr Ala Asp Thr Ala Phe Cys Thr Leu Gly Ala 
340 345 " 350 

Val Lys Ser Asn lie Gly His Leu Asp Ala Ala Ala Gly Val Ala Gly 
355 360 365 

Val He Lys Thr Val Gin Ala Leu Arg His Arg Leu He Pro Pro Thr 
370 375 380 

Leu His Phe Glu Arg Pro Asn Pro Ala Leu His Leu Glu Gin Ser Pro 
385 390 395 400 

Phe Phe Val Asn Thr Gin Pro Leu Pro Trp Glu Ser Pro Arg Gly Pro 
405 410 415 

-rq Leu Ala Gly Val Ser Ser Phe Gly He Gly Gly Thr Asn Ala His 
420 425 430 

Thr Leu Phe Glu Glu Ala Pro Pro Pro Pro Ala Ser Gly Pro Thr Arg 
435 440 445 



Gin Val Leu Leu Leu Ser Ala Arg Ser Thr Ser Ala Leu Glu 
4 50 455 460 
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His lie Ala Gly Arg Leu Ala Ala His Leu Arg Arg His Pro Asp Leu 

465 470 475 480 

Glu Leu Ala Asp Val Ala Phe Thr Leu Gin Val Gly Arg Ala Arg Phe 
485 490 * ^ 495 

Pro Tyr Arg Arg Ala Leu Thr Cys Arg Thr Leu Ala Glu Ala Met Glu 
500 505 510 

Arg Leu Glu Ala Pro Glu Pro Arg Pro Pro Glu Pro Leu Ala His Glu 
515 520 525 

Gly Glu Arg Pro Pro Leu Val Met Leu Phe Pro Gly Gin Gly Thr Pro 
530 535 540 

Leu Val Gly Thr Ala Arg Ala Leu His Glu Ser Glu Pro Thr Phe Arg 
545 550 555 560 

Gin Ala Val Glu Gin Cys Ala Arg Leu Leu Arg Gin Thr Leu Gly Leu 
565 570 " 575 

Asp Val Arg Glu Val Leu Phe Pro Ser Ala Glu Gin Glu Glu Gin Ala 
580 585 590 

Arg Arg Leu Ala Ala Gin Thr Arg Val Ala Gin Pro Ala Leu Phe Thr 
595 600 605 

Leu Glu Tyr Ala Leu Ala Gin Thr Trp Leu Gly Trp Gly Leu Gin Pro 
610 615 620 

Gin Ala Leu Ala Gly His Ser Leu Gly Glu Leu Val Ala Ala Cys Leu 
625 630 635 640 

Ala Gly Val Phe Ser Leu Glu Asp Ala Leu Gin Leu Val Ala Ala Arg 
645 650 655 

Gly Gin Leu Met Gin Gly Cys Pro Pro Gly Ala Met Leu Ala Val Pro 
660 665 670 

Leu Pro Glu Ala Glu Leu Ala Ala Leu Leu Gly Ser Glu Leu Cys He 
675 680 685 

Ala Ala Val Asn Gly Pro Arg Ala Cys Val Ala Ser Gly Pro Leu Pro 
690 695 700 



Ala Val Glu Ala Leu Thr Ala Ala Leu Glu Ser Arg Gly Val Ser Ser 
705 710 715 . 720 
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Arg Arg Leu Glu Thr Ser His Ala Phe His Ser Ala Ser Met Glu Ala 
725 730 ' 735 

Cys Gin Gly Pro Leu Thr Thr Leu Leu Arg Arg Met Arg Leu Gin Ala 
740 745 750 

Pro Arg Leu Pro Cys Val Ser Gly Leu Thr Gly Arg Trp Leu Thr Gly 
755 760 765 

Glu Glu Ala Thr Glu Pro Thr Tyr Trp Ala Arg Gin Leu Arg Glu Pro 
770 775 " 780 

Val Arg Phe Ser Glu Ala Leu Glu Thr Leu Trp Ser Leu Lys Glu Pro 
785 790 795 800 

Val Leu Leu Glu Val Gly Pro Gly Thr Thr Leu Thr Ala Leu Ala Arg 
805 810 815 

Arg His Pro Thr Arg Pro Ala Arg Thr Gin Glu Val Ala Ser Leu Pro 
820 825 830 

Val Gin Pro Asp Thr Ala Val Pro Cys He Glu Glu Ala Val Gly Glu 
835 840 845 

Leu Trp Gin Ala Gly Leu Glu Leu Asp Trp Ser Ala Leu His Ala Ala 
850 855 860 

Pro Arg His Arg Ala His Leu Pro Pro Tyr Pro Phe Glu Arg Gin Arg 
865 870 875 880 

Tyr Trp He Glu Pro Glu Ala Ala Pro Gin Pro Arg Ala Gin Gin Pro 
885 890 " 895 

Thr Pro Ala Ser Leu Val Pro Pro Glu Gin Pro Ser Arg Glu Ala Leu 
900 905 910 

Glu Asp Trp Phe Tyr Val Pro Thr Trp Glu Gin Ala Pro Ala Thr Ser 
915 920 925 

Gly Gly Gly Gin Pro Leu Ala Gly Pro Val Leu Ala Phe Met Asp Ser 
930 935 940 

Ser Gly Leu Ala Glu Gin Val Leu Ala Ala Leu Trp Pro Ala Asp Ser 
945 950 955 960 



ary Al* Leu Leu Thr Arg Val Glu Pro Ala Gly His Tyr Glu Gin Leu 
965 970 975 
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107 



Ser Glu His Ala Phe Arg Leu Arg Pro Glu Ser Glu Glu Asp Trp Asp 
980 985 990 



Ala Leu Phe Gin Ala Leu Gin Ser Gin Gly Arg Leu Pro Arg Arg lie 
995 1000 1005 



Leu His Ala Trp Ala Leu Thr Ala Glu Pro Gly Pro Cys Thr Pro 
1010 1015 1020 



Asp Gly Glu Ala Val Leu Glu Gin Gly Phe Phe Ser Leu' Leu Arg 
1025 1030 1035 



Leu Ala Arg Ala Leu Gly Arg His Ala Pro Glu Arg Pro Val Gin 
1040 1045 1050 



Leu Glu Val Leu Ser Ser Phe Val His Ala Val Gly Pro Arg Glu 
1055 1060 1065 



Pro Leu Glu Pro Leu Lys Ala Thr Leu Leu Gly Ala Cys Ala Val 
1070 ~ 1075 1080 



Leu Pro Leu Glu Tyr Pro His Val Gin Cys Arg Thr lie Asp Val 
1085 1090 1095 



Arg Pro Gly Ser Glu Pro Arg Glu Val Leu Val Arg Ser Leu Ala 
1100 1105 1110 



Ala Glu Leu Ala Ala Pro Met Gly Glu Ser Pro Val Ala Trp Arg 
1115 1120 1125 



Asp Gly Gin Arg Tyr Val Arg Arg Ala Thr Arg Gin Arg Leu Glu 
1130 1135 1140 



Ala Ser Arg Pro Leu Arg Ser Leu Arg Glu Arg Gly Val Tyr Leu 
1145 1150 1155 



Val Ala Gly Gly Leu Gly Gly lie Gly Leu Val Leu Ala Arg Ala 
1160 1165 1170 



Leu Ala Gin Arg Ala Arg Ala Arg Leu Ala Leu Leu Thr His Ser 
1175 1180 1185 



Pro Phe Pro Pro Arg Glu Gin Trp Glu Gin Trp Leu Glu Glu Ala 
1190 1195 1200 



Pro Ala His Pro Glu Pro Ala Trp Arg Ser Glu Ala Asp Pro Ser 
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1205 1210 1215 

Glu Arg Arg Arg Thr Gin His Arg lie Arg Cys Leu Leu Glu Leu 
1220 1225 1230 

Glu Gin Leu Gly Ala Glu Val Gin Val Tyr Thr Ala Asp Val Ala 
1235 1240 1245 

Glu Glu Ala Ala Val Arg Ser Val Val Glu Gin Val His Ala Arg 
1250 1255 1260 

Trp Gly Lys lie His Gly Val Leu His Ala Ala Ala Thr Phe Asp 
1265 ~ 1270 1275 

Asp Gly Val lie Gin Leu Arg Thr His Glu Gin Ser Ser Arg Ala 
1280 1285 1290 

Leu Arg Thr Lys Val Arg Gly Ser Met Val Leu His Glu Val Leu 
1295 1300 1305 

Ala Ser Glu Gly Leu Asp Trp Phe Ala Leu Cys Ser Ser Leu Ala 
1310 ~ 1315 1320 

Ser Ala Leu Gly Ser Phe Gly Gin Ala Asp Tyr Cys Ala Ala Asn 
1325 1330 1335 

Ala Phe Gin Asp Ala Tyr Ala His His Leu Arg Arg Gin Gly Phe 
1340 ' " 1345 1350 

Thr Gly Ala Leu Ala Leu Asp Trp Gly Thr Trp Arg Asp Thr Gly 
1355 1360 1365 

Ala Ala Met Arg Leu Val Ala Arg Thr Arg Arg Gly Gly His Glu 
1370 " 1375 1380 

Lys Pro Pro Thr Pro Leu Thr His Pro Leu Phe Asp Cys Glu Gin 
1385 1390 1395 

Arg Glu Pro Gly Gly Thr His Trp Leu Gly Leu Thr Leu Arg Gly 
1400 1405 1410 

Gly Glu Asp Trp Val Val Asp Glu His Arg Leu Gin Gly Val Pro 
1415 ~ " 1420 1425 



Thr Leu Pro Gly Val Ala Tyr Leu Glu Leu Ala Arg Ala Ala Cys 
1430 ~ 1435 1440 
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Ala Gin Ala Leu Gly Ala Glu Ala Val Glu Leu Ala Glu Leu Leu 
1445 1450 1455 

Leu Leu Glu Pro Leu Thr Val Pro Arg Gly Glu Ser Arg Gin Val 
1460 1465 1470 

Arg Val Val Leu Gin Pro Glu Gly Gin Ala His Ala Leu Arg Val 
1475 1480 1485 

Glu Ser Arg Ser Glu Glu Ala Arg Gly Trp Asn Glu His Ala Arg 
1490 1495 1500 

Gly Arg Val Arg Ala Val Pro Arg Leu Ala Glu Arg lie Gin Pro 
1505 1510 1515 

Glu Leu Leu Arg Ala Ala Cys Glu His Glu Gin Pro Val Pro Gly 
1520 1525 1530 

Glu Pro Gin Glu Gin Gly Pro Val His Ala Gly Ala Arg Trp His 
1535 1540 1 1545 

Gly Leu Phe Gin Trp Val Arg Arg Gly Pro Arg Gin Ala Leu Ala 
1550 1555 1560 

Gin Leu Ala Leu Pro Glu Pro Phe His Gly Asp Leu Glu Arg Phe 
1565 1570 " 1575 

Glu Leu His Pro Ala Leu Met Asp Met Ala Thr Ser Phe Ala lie 
1580 1585 1590 

Pro Gly Gly Val Pro Trp Leu Ala Phe Gly Tyr Glu Arg Val Leu 
1595 1600 ^ 1605 

lie His Gly Pro Leu Pro Pro Gin Val Leu Ser His Val Ser Leu 
1610 1615 1620 

Pro Glu Glu Ser Gin Ala Gly Ala Gin Gin Leu Arg Leu Gin Val 
1625 1630 1635 

Arg Leu Leu Asp Leu Glu Gly Trp Glu Arg Val Arg lie Asp Gly 
164 0 1645 1650 

Tyr Leu Leu Arg Pro Leu Lys Pro Ser Asp Ala Ser Val Glu Pro 
1655 1660 ~ 1665 



Ala Ala Pro Asn Val Glu Val Ala Val Gly Thr Pro Gly. Leu Leu 
1670 1675 1680 
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Glu Ser Leu Gly Leu Arg Arg Cys Thr Arg Pro Ala Pro* Gly Pro 
1685 1690 1695 

Arg Gin Val Glu lie Glu Val Glu Ala Ala Gly Leu Asn Phe Leu 
1700 1705 1710 

Asp Val Leu Gly Ala Leu Gly Met Met Pro Ala Leu Glu Ala Glu 
1715 1720 1725 

Glu Ser Val Leu Gly Arg Glu Cys Ser Gly Arg lie Ala Ala Val 
1730 1735 1740 

Gly Glu Gly Val Ser Gly Leu Arg Val Gly Asp Glu Val Leu Ala 
1745 1750 ** 1755 

Val Ala Pro Gly Cys Phe Arg Ser Tyr Val Leu Val Asp Glu Ser 
1760 1765 1770 

Gin Val Val Arg Arg Pro Ala Ser Leu Gly Leu Ala Glu Gly Ala 
1775 1780 1785 

Ala Gin Met Val Pro Phe Ala Thr Ala Tyr Phe Ala Leu His Thr 
1790 1795 1800 

Val Gly Arg Leu Arg Arg Gly Glu Arg He Leu He His Ala Ala 
1805 1810 1815 

Ala Gly Gly Leu Gly Leu Ala Ala Val Gin Leu Ala Ser Arg Thr 
1820 1825 1830 

Gly Ala Glu lie Leu Ala Thr Ala Gly Ser Glu Gin Lys Arg Glu 
1835 1840 1845 

Tyr Leu Arg Ser Leu Gly He Ala His Val Leu Asp Ser Arg Ser 
1850 1855 1860 

Thr Ser Phe Val Ser Glu Val Arg Glu Arg Thr Gly Gly Arg Gly 
1865 1870 1875 

Veil Asp Val Val Leu Asn Ser Leu Ala Gly Glu Leu Leu Leu Ala 
1880 1885 1890 

i4 lv j-.«u Ser Val Leu Ala Pro His Gly Arg Phe Leu Glu Leu Gly 
1235 1900 1905 



;,vs Arc? Asp Leu Tyr Ala Asp Gin Gin Val Gly Leu Arg Thr Leu 
•i.310 1915 1920 
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Ala Arg Gly Gin Thr Phe Ala Ala lie Asp Phe Gly Pro His His 
1925 1930 1935 



Pro Asp Phe Arg Ala Val Leu Glu Glu Val Ala Thr Gin Leu Thr 
1940 1945 1950 



Gin Gly Gin Leu Glu Pro Leu Pro Thr Arg Leu Phe Pro Ala Arg 
1955 1960 1965 



Gin Val Ala Glu Ala Phe Ser Phe Met Ala Arg Ala Leu His He 
1970 1975 1980 



Gly Arg Val Ala Val Ser Met Gin Gly Ala Thr Ala Leu Pro Ala 
1985 1990 1995 



Ser Met Thr Arg Gly Ser Arg Pro Ala Pro Val Ala Val Pro Pro 
2000 2005 2010 



Trp Glu Asp Pro Arg Leu Ala Gly Gly He Ser Ser Glu Glu Gly 
2015 2020 2025 



Ala Glu Ala Phe Leu Arg Ala Leu Glu Gin Gly Ala Pro Gin Leu 
2030 2035 2040 



He He Ser Pro Gin Asp Phe Ser Ser Leu Leu Arg Gly Leu Gly 
2045 2050 2055 



Gly Ser Gin Gly Val Arg Glu Lys Glu Arg Leu Val Thr Gly Arg 
2060 2065 2070 



Ala Ala Ala Ala Glu Pro Gin Ala Leu Pro Pro Ser Ser Leu Glu 
2075 2'080 2085 



Gin Leu He Glu Gin Val Trp Arg Lys His Leu Gly Val Glu Arg 
2090 2095 2100 



Val Gin Pro Thr Asp Ser Phe Phe Gin Leu Gly Gly Asp Ser Leu 
2105 2110 2115 



Leu Gly lie Gin Val Ala Ala Asp Leu Arg Arg His Leu Gly Val 
2120 2125 2130 



Glu Leu Pro Thr Ala Thr Leu Phe Ser His Pro Thr Leu Ala Ala 
2135 2140 2145 



Leu Ala Ala Ala Leu Arg Ala Arg Gin Gly Glu Ala Ala Ala Pro 
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2150 2155 2160 

Thr Ala Pro Ala Pro Ala Leu Val Pro Asp Pro Ala Ala Arg Phe 
2165 2170 2175 

Glu Pro Phe Pro Leu Thr Asp Val Gin Glu Ala Tyr Trp Val Gly 
2180 2185 2190 

Arg Arg Ser Ala Phe Glu Leu Gly Gly Val Ala Ala His Gly Tyr 
2195 2200 2205 

Phe Glu lie Glu Ser Pro Gly Leu Glu Val Glu Arg Phe lie Gin 
2210 2215 2220 

Cys Trp Arg Gin Leu Leu Gin Arg His Asp Met Leu Arg Met Val 
2225 2230 2235 

Val Leu Pro Asp Gly Arg Gin Gin Val Leu Glu Gin Val Pro Glu 
2240 2245 2250 

Tyr Thr Pro Glu Val Val Glu Leu Arg Gly Leu Ser Pro Gin Glu 
2255 2260 2265 

Ala Glu Ser Arg Arg Leu Gin Leu Arg Glu Arg Met Ala His Gin 
2270 2275 2280 

Val Leu Arg Ser Asp Arg Trp Pro Leu Phe Glu Leu Val Leu Cys 
2285 2290 2295 

Arg Tyr Glu Gly Gly Val Arg lie His Met Ser Met Asp Ala Leu 
2300 2305 2310 

Met Leu Asp Ala Trp Ser Ser Ala Val Leu Arg Gin Asp Phe Ala 
2315 2320 * 2325 

Gin Leu Tyr His Glu Pro Gly Arg Pro Leu Glu Pro Leu Ala lie 
2330 . 2335 2340 

Thr Phe Arg Asp Tyr Val Leu Ala Glu Arg Arg Leu Arg Glu Gly 
2345 2350 2355 

Glu Ala His Glu Arg Ala Arg Ala Tyr Trp Trp Ala Arg Leu Asp 
2360 2365 2370 



Thr Leu Pro Pro Pro Pro Glu Leu Pro Leu Val Lys Glu. Pro Ser 
2375 2380 2385 
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Gin Leu Glu His Ala Arg Phe Thr His Arg Glu Ala Arg.Leu Glu 
2390 2395 2400 

Pro His Arg Trp Ala Arg Leu Gin Glu Arg Ala Arg Ala His Gly 
2405 2410 2415 

Leu Thr Pro Ser Ala Ala Cys Met Ala Ala Phe Ala Glu Val Leu 
2420 2425 2430 

Ala Arg Trp Ser Arg His Pro Arg Phe Thr Leu Asn Leu Thr Leu 
2435 2440 2445 

Phe Gin Arg Leu Pro Leu His Pro Gin Val Asp Glu Leu Val Gly 
2450 2455 2460 

Asp Phe Thr Ser Leu Val Leu Leu Glu Val Glu Ala His Ala Ala 
2465 2470 2475 

Ser Thr Phe Ala Glu Arg Ala Ser Arg Leu Gin Ala Gin 'Leu Trp 
2480 2485 2490 

Arg Asp Leu Glu His Gly Ser Val Ser Ala Val Gin Leu lie Arg 
2495 2500 2505 

Glu Leu Val Arg Thr Gly Arg Arg Ser Pro Gly Ala lie Met Pro 
2510 2515 2520 

Val Val Phe Thr Ser lie Leu Ser Leu Asp Ala Arg Arg Gly Pro 
2525 2530 ~ 2535 

Gin Gly Ser Leu Ser Phe Phe Glu Gly Glu Leu Val Tyr Ser lie 
2540 2545 2550 

Ser Gin Thr Pro Gin Val Trp Leu Asp His Gly Val His Glu Glu 
2555 2560 2565 

Glu Gly Ala Leu Val Leu Ala Trp Asp Ser Val Glu Ala Leu Phe 
2570 2575 2580 

Pro Pro Gly Met Val Asp Asp Met Phe His Ala Tyr Gin Arg Leu 
2585 2590 2595 

Leu Gly Ala Leu Ala Glu Glu Glu Gin Ala Trp Glu Gly Glu Leu 
2600 2605 2610 



Pro Glu Leu Leu Pro Pro Ala Gin Arg Glu Leu Leu Ala Arg Tyr 
2615 2620 2625 
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Asn Ala Thr Gin Ala Pro Arg Pro Ser Gly Arg Leu Glu Glu Gly 
2630 2635 2640 

Phe Phe Thr Gin Ala Arg Leu His Pro Glu Leu Pro Ala Leu Leu 
2645 2650 2655 

Ala Pro Glu Arg Thr Leu Ser Tyr Gly Glu Leu Ala Arg Arg Ala 
2660 2665 2670 

Gin Ala Leu Ala Ala Arg Leu Arg Glu Leu Glu Val Gin Pro Gin 
2675 2680 2685 

Glu Leu Val Ala lie Ala Met His Lys Gly Trp Glu Gin Ala Thr 
2690 2695 2700 

Ala Val Leu Gly Val Leu Gin Ala Ala Ala Ala Tyr Leu Pro Leu 
2705 2710 2715 

Asp Pro Glu Gin Pro Pro Leu Arg Leu His Gin Leu Leu Glu Glu 
2720 2725 2730 

Gly Pro Ala Arg Val Val Leu Thr Gin Ser Ser Leu Leu His Thr 
2735 2740 2745 

Val Pro Trp Pro Pro Gly Val Gin Val lie Ala Val Asp Glu Leu 
2750 2755 2760 

Glu Pro Ala Thr Glu Ala Pro Pro Leu Pro Pro Arg Gly Thr Pro 
2765 2770 2775 

Glu His Leu Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr Gly Lys 
2780 2.785 2790 

Pro Lys Gly Val Ala lie Glu His Arg Ala Ala Leu Asn Thr Val 
2795 2800 2805 

Val Asp Leu Asn Thr Arg Phe Gly Val Gly Pro Glu Asp Arg Val 
2810 2815 2820 

Leu Gly Leu Ser Ala Leu Thr Phe Asp Leu Ser Val Tyr Asp Val 
2825 2830 2835 

Leu Gly Leu Leu Gly Ala Gly Gly Ala Leu Val Leu Pro Ala Ala 
2840 2845 2850 



Glu Ala Glu Lys Asp Pro Ala His Trp Trp Glu Arg Leu Val Ala 
2855 2860 2865 



WO 2004/022586 M W PCT/EP2003/009780 

115 

Gly Arg Val Thr Val Trp Asn Ser Thr Pro Ala Leu Met Leu Leu 
2870 2875 2880 

Leu Val Glu Tyr Ala Glu Gin Arg Gly Leu Lys Leu Pro Ala Ala 
2885 2890 2895 

Leu Arg Leu Val Met Leu Ser Gly Asp Trp lie Pro Val Ala Leu 
2900 2905 2910 

Pro Asp Arg lie Arg Ala Leu Gly Arg Asp Val Gin Val Val Ser 
2915 2920 2925 

Leu Gly Gly Ala Thr Glu Ala Ser lie Trp Ser lie Ala Tyr Pro 
2930 2935 2940 

lie Gly Gin Val Ala Pro Gin Trp Lys Ser lie Pro Tyr Gly Met 
2945 2950 2955 

Pro Leu Ala Asn Gin Arg Phe His Val Leu Asp Gly Arg Leu Glu 
2960 2965 2970 

Ala Arg Pro Trp Trp Val Pro Gly Glu Leu Tyr He Gly Gly Glu 
2975 2980 2985 

Gly Leu Ala Arg Glu Tyr Trp Arg Asp Glu Pro Leu Thr Ala Thr 
2990 2995 3000 

Arg Phe He Arg His Pro Arg Thr Gly Glu Arg Leu Tyr Arg Thr 
3005 3010 3015 

Gly Asp Gin Gly Arg Met Leu Pro Glu Gly Ser He Glu Phe Leu 
3020 3025 3030 

Gly Arg Glu Asp Leu Gin Val Lys Val Gin Gly Phe Arg Val Glu 
3035 3040 3045 

Leu Gly Glu He Glu Ala Ala Leu Ala Gin His Pro Ala Leu Ser 
3050 3055 3060 

Ala Ser Val Val Val Ala Arg Gly Glu Pro Arg Gly Val Arg Arg 
3065 3070 3075 

Leu Val Ala Tyr Ala Val Pro Arg Ser Gly Gin Thr Pro Ala Ala 
3080 3085 3090 



Gly Glu Leu Arg Arg Tyr Leu Ala Glu Arg Leu Pro Ala Tyr Met 
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3095 3100 3105 

Val Pro Ser Ala Phe Val Leu Leu Glu Ser Leu Pro Arg Ser Arg 
3110 3115 3120 

Asn Gly Lys He Ala Arg Asp Gin Leu Pro Glu Pro Gin Gin Thr 
3125 3130 3135 

Gin Gly Leu Ala Ala Gin Ala Ala Ala Ala Asp Pro Leu "Val Glu 
3140 3145 3150 

Arg Leu Ala Ala Leu Val Lys Glu Ala Leu Arg Leu Glu Arg Val 
3155 3160 3165 

Glu Pro Gin Asp Ser Leu Leu Asp Leu Gly Ala Asp Ser Val Ala 
3170 3175 3180 

Leu He Arg Leu He Asn Arg Leu Glu Ala Glu Leu Gin Phe Arg 
3185 " 3190 3195 

Pro Arg Leu Ala Asp He Tyr Glu Asn Pro Thr Val Gin Gly Leu 
3200 3205 3210 

Ala Thr Leu His Gin Glu Lys Thr Lys Ser Gin Gly Glu Gly Gly 
3215 3220 3225 

Ala Pro Arg Leu Thr Ala Pro Arg Ser Thr Leu Leu Pro Ala Glu 
3230 3235 3240 

Glu Trp Gly Arg Phe Lys Ala Asn Arg Pro Gly Leu Arg Arg Phe 
3245 3250 3255 

Pro Asp Gly Thr Pro Glu Val Ala Leu Pro Gly Ser Gly Leu Ala 
3260 " 3265 3270 

Pro Ala Pro Glu Glu Leu Thr Ala Leu Glu Arg Arg Arg Ser Val 
3275 3280 3285 



-~ T*r Tyr Ser Leu Glu Pro Val Ser His Glu Gin Leu Gly Arg 
" 3290 3295 3300 



Leu Leu Ala Pro Leu Arg Glu Trp Glu Val Gin Gly Ser Arg Arg 
2205 3310 3315 

Tvr Leu Tyr Ala Ser Ala Gly Gly Leu Tyr Pro Val Gin Leu Tyr 
3320 3325 3330 
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Leu His Leu Lys Pro Gly Arg Ala Arg Gly Leu Glu Pro Gly Thr 
3335 3340 3345 

Trp Tyr Tyr Asp Pro Ser Thr His Arg Leu Val Leu Leu Ser Ala 
3350 3355 3360 

Gly Ala Gly Leu Asp Arg Arg lie His Asp Pro His Gin Asn Gin 
3365 3370 3375 

Ala lie Phe Asp Ser Ala Ala Phe Ser Leu Phe Leu lie Ala Arg 
3380 3385 3390 

Met Gly Ala Val Glu Pro Val Tyr Ala Glu His Ala Leu His Phe 
3395 3400 3405 

Ala Thr Leu Glu Ala Gly Leu Met Thr Gin Leu Leu Asp Leu Gly 
3410 3415 3420 

Ala Ala Pro Ser Gly Leu Gly Leu Cys His lie Gly Asp Leu Asp 
3425 3430 3435 

Phe Ala Gin Ala Arg Gly Leu Phe His Leu Glu Glu Glu His Val 
3440 3445 3450 

Leu Leu His Ser Leu Val Gly Gly Val Leu Pro Thr Arg Gly Gin 
3455 3460 " 3465 

Glu Ala Ala Ser Val Pro Ala Glu Gly Gly Thr Glu Ala Arg Gin 
3470 3475 ~ 3480 

Leu Ala Gin Leu Leu Gin Gin Val Lys Thr Leu Thr Pro Glu Ala 
3485 3490 3495 

Ala Arg Ala Leu Leu Glu Ala Arg Arg Gly Ser Lys Gly Arg Pro 
3500 3505 ~ 3510 

His Glu 
3515 

<210> 13 

<211> 2837 

<212> PRT 

<213> Angiococcus discif onais 



<400> 13 
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Met Ser His Asp Thr Gin Leu Glu Gly Ala Val Ala Val Val Ala Met 
15 10 I 5 

Ala Gly Arg Phe Pro Gin Ala Pro Thr Leu Glu Asp Tyr Trp Arg He 
20 25 30 

He His Glu Gly Val Asp Ala Phe Thr Gin Leu Thr Asp Glu Gin Leu 
35 ~ l 4 0 45 

Leu Ala Ser Gly Val Gly Pro Ser Leu Leu Arg Gin Pro Gly Tyr Val 
50 ~ 55 60 

Arg Arg Ala Ala Leu Leu Asp Arg Met Asp Gin Phe Asp Ala Ala Phe 
65 70 75 80 

Phe Gly Phe Ser Pro Arg Glu Ala Glu Val Leu Asp Pro Gin His Arg 
85 90 95 

Leu Phe Leu Glu Cys Ala His Glu Ala Leu Glu. Arg Ala Gly His Gly 
100 105 HO 

Ser Glu Arg Ala Arg Gly Arg Val Gly Val Phe Ala Ser Ala Ser Leu 
115 " 120 125 

Asn Ser Tyr Tyr Leu His Ser Leu His Gly Asn Ala Arg Leu Arg Glu 
130 135 140 

Val Leu Gly Asp Phe Gin Leu Ala He Ala Asn Asp Lys Asp Phe Leu 
145 150 155 160 

Pro Thr Arg Val Ser Tyr Lys Leu Gly Leu Arg Gly Pro Ser Val Ala 
165 170 175 

Val Gin Thr Ala Cys Ser Ser Ser Leu Val Ala Val His Leu Ala Cys 
180 185 190 

Gin Ser Leu Leu Asn Gly Glu Cys Ala Leu Ala Leu Ala Gly Gly Ser 
195 200 205 

Ser Leu Ser Val Pro Gin Ala Gin Gly Tyr Leu Tyr Gin Glu Gly Gly 
210 215 220 

He Ala Ser Pro Asp Gly Tyr Cys Arg Pro Phe Asp Ala Ala Ala Ala 
225 230 235 240 



Glv Thr Asn Arg Gly Asn Gly Val Gly Val Val Leu Leu Lys Leu Leu 
245 250 255 
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Glu Glu Ala Leu Ala Asp Gly Asp Thr lie His Ala Val lie Arg Gly 
260 265 270 

Ser Ala Val Asn Asn Asp Gly Ala His Lys lie Gly Tyr Thr Ala Pro 
275 280 285 

Ser Val Glu Gly Gin Ala Ser Val lie Ser Glu Ala Leu Glu Val Ala 
290 295 300 

Ala Val Ser Ala Asp Ser lie Gly Tyr Val Glu Ala His Gly Thr Ala 
305 310 315 320 

Thr Ala Leu Gly Asp Pro lie Glu Val Gin Ala Leu Thr Arg Ala Phe 
325 330 335 

Arg Lys His Thr Glu Arg Arg Gly Tyr Cys Ala Leu Gly Ser Val Lys 
340 345 350 

Ala Asn lie Gly His Leu Asp Ala Ala Ala Gly lie Ala Ser Leu lie 
355 360 365 

Lys Ala Val Leu Val Leu Glu Arg Arg Gin Leu Pro Pro Cys Pro His 
370 375 380 

Phe Thr Ser Pro Asn Pro Arg lie Asp Phe Glu Arg Ser Pro Phe Tyr 
385 390 395 400 

Val Ser Gly Arg Gly Gin Pro Trp Glu Pro Val Asp Gly Pro Arg Arg 
405 410 415 

Ala Gly Val Ser Ser Phe Gly lie Gly Gly Thr Asn Val His Val Val 
420 425 430 

Leu Glu Glu Ala Pro Pro Arg Pro Ala Pro Arg Arg Ala Ala Arg Pro 
435 440 445 

Tyr Gin Leu Leu Pro Leu Ser Ala Arg Thr Glu Arg Ala Arg Glu Glu 
450 455 460 

Val Glu Ala Arg Leu Arg Glu His Leu Arg Gin Gin Pro Gin Glu Pro 
465 470 475 480 

Leu Glu Glu Val Ala His Thr Leu Gin Val Gly Arg Arg His Met Ala 
485 490 495 



Trp Arg Thr Ala Leu Val Ser Ser Ser Pro Ala Gin Ala Val Glu Leu 



WO 2004/022586 PCT/EP2003/009780 

W 120 

500 505 510 

Leu Glu Arg Arg Arg Pro Glu Ala Leu Leu Arg Gly Gin Ser Ala Ala 
515 520 525 

Gin Ala Arg Ser Val Ala Phe Leu Leu Pro Gly Gin Gly Ser Gin Tyr 
530 535 540 

i 

Val Gly Met Gly Ala Ala Leu His Glu Ser Glu Ala Pro Phe Arg Glu 
545 550 555 560 

Gin Val Asp Leu Cys Ala Gly Lys Leu Gin Gly Val Leu Gly Leu Asp 
565 ~ 570 575 

Val Arg Arg Leu Leu Tyr Pro Gly Pro Arg Glu Arg Glu Trp Ala Gin 
580 585 590 

Glu Arg Leu Arg Glu Thr Arg Val Cys Gin Pro Val Leu Phe Thr Val 
595 600 605 

Glu Tyr Ala Leu Ala Arg Leu Leu Glu Ala Trp Gly He Arg Pro Thr 
610 615 620 

Ala Leu Leu Gly His Ser Leu Gly Glu Tyr Val Ala Ala Cys Leu Ala 
625 630 635 640 

Gly Val Phe Ser Leu Glu Glu Ala Leu Glu Val Val Ala Ala Arg Gly 
645 650 655 

Gin Leu Met Gly Ser Met Pro Pro Gly Gly Met Leu Ala Val Gly Leu 
660 665 670 

Pro Ala Gin Glu Val Glu Pro Leu Leu Pro Ala Ala Leu Ala Leu Ala 
675 680 685 

Ala His Asn Ser Pro Gin Ala Cys Val Val Ala Gly Pro Thr Glu Pro 
690 695 700 

Leu Ala Glu Leu Arg Ala Arg Leu Glu Gin Arg Gly He Ala Cys Thr 
705 710 715 720 

Pro Leu Ala Val Ser His Ala Phe His Ser Pro Met Met Glu Pro Ala 
725 730 735 



Val Ala Pro Phe Val Ala Arg Leu Arg Arg Met His Leu Arg Pro Pro 
740 745 750 
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Ser Leu Pro Phe lie Ser Asn Val Thr Gly Thr Trp lie Glu Ala Glu 
755 760 "* 765 

Glu Ala Thr Ser Pro Glu Tyr Trp Gly Arg His Leu Leu Gin Pro Val 
770 775 780 

Arg Phe Ala Gin Gly Leu Glu Arg Leu Cys Glu Gly Val Gin Pro Arg 
78 5 790 " 795 - 800 

Leu Leu Leu Glu Val Gly Pro Gly His Thr Leu Gly Arg Leu Ala Ala 
805 810 ^ 815 

Arg Gin Ser Ser Gly Pro Val Gin Val Val Ser Thr Leu Gly Ser Ser 
820 825 830 

Arg Glu Glu Ser Ser Glu Val Glu Arg Leu Leu Thr Ala lie Gly Arg 
835 840 845 

Leu Trp Val Glu Gly Ala Glu Val Asp Trp Ala Gly Leu His Arg Gly 
850 ■ 855 860 

Glu Arg Arg Arg Arg Ala Val Leu Pro Thr Tyr Pro Phe Glu His Gin 
865 870 875 880 

Arg Tyr Trp Val Glu Thr Ala Pro Gin Pro Ser Pro Pro Glu Arg Pro 
885 890 895 

Gly Thr Pro Ala Glu Ser Pro Val Thr Ser Ser Phe Tyr Val Pro Gly 
900 905 910 

Trp Ser Arg Ala Ala Leu Pro Ser Ala Ala Pro Ser Arg Arg Val Gly 
915 920 925 

Arg Leu Leu Leu Leu Ala Glu Ala His Gly Trp Ser Gin Gly Leu Ala 
930 935 940 

Glu Arg Leu Arg Ala Arg Gly His Ser Val Thr Leu Val Glu Pro Gly 
945 950 955 960 

Glu Arg Leu Glu Arg Leu Thr Pro Glu His Trp Arg Leu Pro Pro Gly 
965 970 " 975 

Arg Arg Glu Asp phe Gin Arg Leu Leu Glu Asp Ser Gly Glu Leu Pro 
980 985 990 

• 

Ser Arg Val Leu His Leu Trp Leu Leu Gly Ser Ser Gin Ser Pro Gin 
995 1000 1005 
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Glu Arg Gly Phe Tyr Thr Leu Leu Ala Leu Ala Gin Ala Leu Gly 
1010 1015 1020 

Ala His Gly Thr Arg Pro Ala Val Glu Leu Thr Val Val Thr Asp 
1025 1030 1035 

Gin Leu Tyr Ala Val Ala Glu Gly Glu Pro lie Gin Pro- Leu Lys 
1040 1045 1050 

Ala Leu Leu Gin Gly Pro Ala Ser Val Leu Pro Gin Glu Leu Pro 
1055 1060 1065 

Gly Cys Thr Cys Arg Leu Val Asp Met Ala Leu Pro Pro Gly Gly 
1070 1075 1080 

Val Ala Glu Glu Leu Leu Glu Arg Leu Val Ala Glu Val Glu Ser 
1085 1090 " 1095 

Thr Ala Ser Glu Arg Ser Val Ala Tyr Arg Gly Ala Ala Arg Trp 
1100 1105 1110 

Val Arg Glu Phe Val Pro Val Pro Leu Pro Pro Pro Glu Pro Ser 
1115 1120 1125 

Gin Leu Pro Leu Arg Gin Arg Gly Val Tyr Leu lie Val 'Gly Gly 
1130 1135 1140 

Leu Gly Gly Val Gly Leu Ala Leu Ala Glu His Leu Ala Arg Arg 
1145 1150 1155 

Val Ser Ala Arg Leu Val Leu Thr Gly Arg Ser Pro Thr Pro Pro 
1160 1165 1170 

Arg Slu Ser Trp Ser Ala Trp Leu Gly Thr Pro Thr Arg Leu Arg 
1175 1180 1185 

Leu Ser Gin Glu Leu Glu Trp Leu Arg Gly Ala Ala Glu Gin lie 
1190 1195 1200 

Glu His Gin Arg Pro Leu His Ser Leu Ala Asp Ser Pro Gly Leu 
1205 1210 1215 

Glu Giu Ser Leu Arg Leu Leu Cys Ala Ser Tyr Leu Tyr His Phe 
U*2C ^ 1225 1230 



7.*m Ph^ Pro Leu Gin Ala Pro Leu Arg Leu Gly Glu Pro Arg Ala 
1235 1240 1245 
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Met Gin Ala Leu Arg Glu Arg Leu Gly Leu Gin Pro Gly Phe Glu 
1250 1255 1260 

Arg Leu Phe Ser Phe Met He Gly Thr Leu Glu Gin Ala Lys Leu 
1265 1270 1275 

He Arg Val Gly Glu Gly Met Leu Glu Gly Gin Val Glu Pro Ala 
1280 1285 1290 

His Val Pro Thr Pro Arg Ala Leu His Glu Arg Leu Leu Glu Gly 
1295 1300 1305 

Tyr Pro Glu Val Ser Gly Leu Leu Glu Leu Leu Glu His Cys Val 
1310 1315 1320 

Arg His Tyr Pro Glu Val Leu Arg Gly Ser Leu Pro Ala Leu Ser 
1325 1330 1335 

Val Leu Tyr Pro Ser Gly Arg Ser Glu Glu Asp Ser Ala Arg Ser 
1340 1345 1350 

Gly Val Glu Trp Ser Ser Met Gly Gin Cys Val Ala Leu Leu Arg 
1355 " 1360 1365 

His Phe Leu Ala Arg His Ala Glu Arg Thr Gin Gly Arg Thr Leu 
1370 1375 1380 

Arg He Leu Glu Val Gly Gly Gly Ser Gly Val Val Leu Gin Ala 
1385 1390 1395 

Leu Leu Pro Leu Leu Arg His His Pro Val Glu Tyr His Phe Thr 
1400 1405 1410 

Asp He Gly Pro Ser Phe Val Arg Ala Met Glu Gly Val Gly Arg 
1415 ^ 1420 1425 

Arg Gin Gly Leu Thr Phe Leu His Thr Ser Val Leu Asp He Ser 
1430 " 1435 1440 

Arg Pro Pro Pro Glu Gin Gly His Pro Pro Gly Ser Tyr Asp Leu 
1445 1450 1455 

Val He Ala Leu Asn Val Val His Ala Thr Pro Arg Val Pro Gin 
1460 1465 1470 



Ser Leu Ala His Leu Glu Ser Leu Leu Val Pro Gly Gly His Leu 
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1475 1480 1485 

Cys Leu Val Glu Thr Val Lys Gin Gin Pro Trp Val Asp Met He 
1490 1495 1500 

Trp Gly Leu Ala Glu Gly Trp Trp Ser Tyr Glu Asp Glu Leu Arg 
1505 1510 1515 

Thr Arg Ser Pro Leu Leu Glu Val Gly Asp Trp Glu His Ala Leu 
1520 1525 1530 

Arg Asp Val Gly Phe Ala Glu Val Glu Val Leu Pro Ala Ala Val 
1535 1540 1545 

Glu Gin Arg Ser Arg Trp Asp Asn Val Leu Leu He Ala Gin Arg 
1550 " 1555 1560 

Pro Gly Glu Ser Gly Leu Gin Pro Val Gly Gly Arg Ala Ala Met 
1565 1570 1575 

Gin Glu Arg He Arg Arg Leu Arg Ala He Glu Glu Ala Gly Gly 
1580 " 1585 1590 

Glu Val Leu Pro Leu Val Ala Asp Val Thr Asp Arg Glu Arg Met 
1595 1600 1605 

Gly Glu Val Leu Ala Glu Val Lys Arg Arg His Gly Ala Leu His 
1610 1615 1620 

Gly Val He His Ala Ala Leu Val Leu Glu Asp Gly Leu Met Gin 
1625 1630 1635 

Leu Lys Thr Arg Glu Ser Ala Gly Arg Val Leu Ala Ser Lys Val 
1640 " 1645 1650 

Glu Gly Thr Leu Val Leu Asp Glu Leu Leu Arg Asp Glu Pro Leu 
1655 1660 1665 

Asd Phe Phe Val Leu Cys Ser Ser Leu Ser Ala Leu Leu Gly Ala 
1670 1675 1680 

Leu Gly Gin Ala Asp Tyr Ala Ala Ala Ser Ala Phe Leu Asp Ala 
i£Q5 1690 1695 



Tyr Ala His Ser Gin Arg Gly Arg Thr Asp Arg Arg Thr He Ser 
1700 1705 1 71 0 
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Met Asp Trp Asp Arg Trp Leu Glu Val Gly Ala Ala Met Arg Leu 
1715 " 1720 1725 

Gly Leu Gly Leu Ala Ala Gly Ala Leu Gly Leu Gin Arg Thr Ala 
1730 1735 1740 

Pro Gly Glu Tyr Thr Val Arg Trp Gin Ala Glu Arg Cys Trp Trp 
1745 1750 1755 

Leu Asp Glu His Arg Leu Glu Gly Arg Ala Thr Leu Pro Gly Val 
1760 1765 1770 

Ala Tyr Leu Glu Leu Val Arg Ala Ala Leu Val Gin Glu Leu Gly 
1775 1780 1785 

Glu Ala Pro Val Glu Leu Glu Gin Leu Val Leu Leu Ser -Leu Leu 
1790 1795 1800 

Glu Ala Pro Ala Gly Glu Glu Val Glu Val Arg Phe His Leu Arg 
1805 1810 1815 

Pro Glu Asp Glu Gly Tyr Ala Leu Glu lie Arg Ser Arg Ala Gly 
1820 1825 1830 

Gly Leu Ala Asn Gly Gly Trp Arg Ser His Ala Met Gly Arg Val 
1835 1840 1845 

Arg Met Leu Pro Arg Gly Ser Ala Arg Pro Thr His Pro Leu Arg 
1850 1855 1860 

Glu Leu Glu Glu Arg Leu Gly Leu Thr Gin Ala Pro Arg Glu His 
1865 1870 1875 

Glu Pro Ala Gin Gly Pro Arg Glu Leu Ala Pro Ala Leu Gly Arg 
1880 1885 1890 

Arg Trp Ser Ser Leu Ser Trp His Arg Ala Trp Lys Gly Glu Glu 
1895 1900 1905 

Gly Leu Ala Leu lie Glu Leu Pro Glu Glu Leu Ala Glu Asp Leu 
1910 1915 1920 

Arg Gin Trp Pro Leu His Pro Ala Leu Leu Asp Ala Ala Thr Gly 
1925 1930 1935 



phe Ala Pro Leu Pro Pro Gly Ala Trp Leu Pro Leu Ser Tyr Gly 
1940 1945 1950 
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Asn Thr Arg He His Gly Pro Leu Pro Arg Gin Leu Tyr Ser His 
1955 I960 1965 

He Arg Arg Leu Glu Pro Ala Ser Ala Gin Ala Gly Val Val Arg 
1970 " 1975 1980 

Leu Glu Val Arg Leu Met Asp Gly Glu Gly Arg Glu Leu Leu Cys 
1985 " 1990 1995 

Val Glu Glu Phe Val Leu Arg Arg Val Glu Val Asp Ser Leu Ala 
2000 2005 2010 

Arg Pro Gin Pro Ala Gly Lys Ala Pro Arg Gin Ala Val Leu Pro 
2015 2020 2025 

Arg Pro Gly Ala Leu Asp Ser Leu Arg Leu Gin Pro Leu Glu Arg 
2030 2035 2040 

Leu Pro Pro Gin Glu Gly Glu Val Glu Val Gin Val Leu Ala Ala 
2045 2050 2055 

Gly Leu Asn Phe Lys Asp Ala Leu Leu Ala Leu Gly Ala Leu Pro 
2060 2065 2070 

Val Glu Leu Ala Asn Gly Ala Pro Leu Ala Leu Gly Val Glu Cys 
2075 2080 2085 

Ala Gly He He Ser Ala Val Gly Pro Gly Val Arg Gly Leu Arg 
2090 2095 2100 

Val Gly Glu Ala Val Val Ala Ala Ala Ala Gly Ala Phe Ala Ser 
2105 2110 2115 

His Val Arg Val Pro Gin Glu Gin Val Phe Pro Lys Pro Ala Gly 
2120 " 2125 2130 

Leu Ser Phe Glu Gin Ala Ala Met Val Pro Val Thr Leu Phe Thr 
2135 2140 2145 

Ala Trp Tyr Ala Leu Glu Glu Leu Ala Arg Leu Arg Ala Gly Glu 
2150 2155 2160 

His Val Leu He His Ala Ala Ala Thr Gly Val Gly Leu Ala Gly 
2165 2170 2175 



Val Lys Leu Ala Leu Arg Arg Gly Ala Thr Val Tyr Ala Thr Ala 
2180 2185 2190 
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Gly Ser Glu Pro Lys Arg Glu Leu Leu Arg Ser Leu Gly Val Thr 
2195 2200 2205 

Leu Ala Met Asp Ser Arg Ala Pro Gly Phe Asp Asp Gin Leu Leu 
2210 2215 2220 

Gin His Thr Gin Gly Arg Gly Val Asp Val Val Leu Asn Ser Leu 
2225 2230 2235 

Ser Gly Glu Phe Leu Ser Arg Ser Leu Gly Val Leu Ala Arg His 
2240 2245 2250 

Gly Arg Phe Val Glu Leu Gly Val Arg Asp Met Leu Ser Gly Gly 
2255 2260 2265 

Thr Leu Pro Leu Ala Pro Phe Glu Arg Gly Leu Thr Phe Leu Ala 
2270 227.5 2280 

Ala Gin He Asp Pro Gly Met Lys Gly Tyr Arg Glu Leu Met Gly 
2285 2290 2295 

Glu Ala Leu Arg Gin He Glu Arg Gly Glu Leu Glu Pro Leu Pro 
2300 2305 2310 

Tyr Thr Ala Trp Pro Leu Glu Arg Val Ala Glu Ala Leu Gin Leu 
2315 2320 2325 

\ 

Val Ser Lys Gly Arg His Val Gly Lys Val Val Leu Thr Pro Glu 
2330 2335 2340 

Glu Pro Leu Ser Arg Pro Pro Ala Leu Pro Ser Ser Ser Arg Pro 
2345 2350 2355 

Val Gin Pro Arg Ala Gly Val Ala He Val Gly Leu Arg Ser Ala 
2360 2365 2370 

Glu Gly Cys Glu Ala Phe Glu Arg He Leu Ala Ala Gly Leu Pro 
2375 2380 2385 

Gin Val Ala Val Ser Thr Arg Glu Leu Arg Ala Arg Met Ala Glu 
2390 2395 2400 

He Glu Arg Leu Arg Val Ser Ser Trp Glu Pro Ala Leu Pro Ser 
2405 2410 2415 



Val Pro Arg Gly Ser Pro Ala Arg Thr Glu Arg Ser Lys Pro Tyr 
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2420 2425 2430 

Val Ala Pro Arg Thr Glu Arg Glu Arg Ala Leu Ala Ala Leu Trp 
2435 2440 2445 

Gin Asp Leu Leu Gly Val Ser Glu Val Gly Leu Asp Asp -Asp Phe 
2450 2455 2460 

Phe Glu Leu Lys Gly Asp Ser Leu Leu Ala lie Gin Leu Met Gly 
2465 ** 2470 2475 

Arg Leu Arg Lys Glu Leu Glu Val Glu Leu Pro Leu Ser Ser Phe 
2480 " 2485 2490 

Leu Ala Arg Pro Thr Leu Arg Thr Leu Leu Ser Ala Met Pro Ala 
2495 2500 2505 

Pro Ala Thr Asp Glu Pro Ala Ser Ala Thr Gly Ala lie Pro Ala 
2510 " 2515 2520 

Ala Ala Pro Gin Pro Ser Pro Ala Val Ser Ser Glu Pro Ala Arg 
2525 2530 2535 

Lys Trp Arg His Leu Val Pro He Gin Pro Gin Gly Glu Ser Pro 
2540 2545 2550 

Pro Phe Phe Trp Ala Ala Pro Leu Met Gly Thr Val Phe Pro Tyr 
2555 2560 2565 

Phe Thr Leu Ala Arg Cys Leu Ala Pro Thr His Pro Phe Tyr Ala 
2570 ~ * 2575 2580 

Leu Gin Pro Pro Gly Leu Gin Glu Gly Gin Glu Pro Met Asp Lys 
2585 2590 2595 

Val Glu Glu Leu Ala Ala Leu Tyr Val Arg Glu Leu Arg Glu Leu 
2600 2605 2610 

Bxq Arg Gly Pro Tyr Arg Leu Gly Gly Trp Ser Phe Gly Cys 
2£15 ~ ' 2620 2625 

Val Val Ala Tyr Glu Val Ala Leu Gin Leu Glu Ala Ala Gly Glu 
o«n 2635 2640 



Gin Val Ala Leu Leu Ser Leu Leu Asp Phe Pro Ala Pro Ser Gly 
v£a5 2650 2655 
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Gin Arg Ala Ser Leu Trp Ala Thr Ala Arg Phe Phe Ser Gly Ser 
2660 2665 2670 

Val Leu Arg Gly Leu Ala Pro Tyr Thr Leu Asp Tyr Leu Tyr Leu 
2675 2680 2685 

Ala Ala Arg Ala Pro Leu Asn Pro Asp Ala Ala Ser Arg Leu Ser 
2690 2695 2700 

Ser Ser Leu Thr Arg Gly Trp Glu Arg Leu Arg Lys Gly Gly Leu 
2705 2710 2715 

Val Gin Glu Leu Leu Asp His Ala Ala Met Ala Arg Val Val Pro 
2720 2725 2730 

Glu Asp Ser Arg Leu Leu Leu Leu Arg Glu Pro Gly lie Ala Pro 
2735 2740 2745 

Met Leu Arg Leu Thr Arg Ala His Gin Arg Ala Met Leu Ser Tyr 
2750 2755 2760 

Arg Pro Arg Gly Arg Leu Arg Gin Arg lie Val Leu Phe Arg Thr 
2765 2770 2775 

Ser Glu Gin Ala Trp Ser Phe Ala Arg Asp Leu Gly Trp Gly Ala 
2780 2785 2790 

Leu Ser Ala Ser Gly Val Asp Val Arg Glu Ala Pro Gly Asp His 
2795 2800 2805 

Met Thr Leu Leu Arg Pro Pro His Val Glu Arg Val Ala Glu Gin 
2810 2815 2820 

Leu Arg Ala Leu Leu Ser Ser Ala Pro Leu Pro Thr Arg Arg 
2825 2830 2835 

<210> 14 

<211> 385 

<212> PRT 

<213> Angiococcus discif ormis 



<400> 14 



Met Lys Phe Pro Pro Ala Ser Leu Cys Phe Val Leu Pro Met Leu Ser 
15 10 15 
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Ala Thr Ser Val Gin Ala Gin Ala Pro Glu Thr Trp Thr Glu His Trp 
20 25 30 



Phe Glu His Asn Gin Thr Val Ser Arg Val Tyr Gin Asp Asn Asp Val 
35 40 45 



Ala Val Tyr Phe Asp Ser Ala Val Asn Arg Ser lie Thr Trp Pro Asn 
50 " 55 60 



Thr Tyr Leu Gly Asp Val Trp Arg Tyr Thr Lys Arg Thr Tyr Gly His 
65 70 75 80 



Phe Gly Thr Asp Ser Gin Leu Tyr Thr lie Phe His Ala Gly Lys Tyr 
85 90 95 



Ser Gly Gly His Pro Ser Thr Tyr Phe Asp Ala Ser His Asp Arg Arg 
100 105 110 



Asn Val lie Asp Val Gly Ser Ser Ser Pro Thr Ala Trp Thr Ala Gly 
115 120 125 



Thr Gly Asn Asp Leu Asp lie Val Thr His Glu Val Ala His lie Val 

130 135 140 

Glu Leu Ala Ser Lys Gly Val His Asp Ser Pro Ala Phe Gly Leu Trp 

145 150 155 160 



Arg Asp Ser Lys Trp Ala Glu lie Phe Asn Tyr Asp Val Tyr Leu Ala 
165 170 175 



Leu Gly Arg Thr Ser Asp Ala Asn Arg Trp Tyr Asn Leu Met Val Asn 
180 *" 185 190 



Thr Thr Asp Ser Phe Pro Arg Ala Asn Thr His Trp Phe Arg Asp Trp 
195 200 205 



Phe Tyr Pro lie Tyr Lys Asn His Gly Gly Ser Ser Val Leu Asn Arg 
210 215 220 



Tyr Phe Val Leu Leu Ala Gin Tyr Leu Pro Lys Asn Gly Ser Asn Tyr 
225 230 235 240 



Ala Arg Ser Leu Asn Trp Gly Glu Phe lie His Phe Trp Ser Gly Ala 
245 250 255 



Ale GIv Val Asn Leu Lys Thr Leu Ala Thr Ser Ala Phe Gly Trp Pro 
260 265 270 



WO 2004/022586 PCT/EP2003/009780 

131 

Ala Glu Trp Glu Ala Gin Phe Thr Gin Ala Gin Arg Asp Phe Pro Phe 
275 280 285 

Thr Tyr Thr Pro Pro Gly Pro Thr Ala Val Thr Val Phe Gin Asp Gin 
290 295 300 

Asn Tyr Gly Gly Tyr Gly Met Ala Leu Pro Val Gly Arg Tyr Thr Leu 
305 ^ 310 315 320 

Ser Ala Leu His Ala Trp Gly Val Arg Asn Asp Asp lie Thr Ser Leu 
325 330 335 

Lys Val Ala Ser Gly Tyr Lys Val Thr Leu Tyr Glu His Asp Asn Phe 
340 345 350 

Ser Gly Ala Ser Leu Thr Lys Thr Ala Asp Asp Ala Ser Leu Val Asp 
355 360 365 

Asp Ser Trp Asn Asp Lys Met Ser Ser Leu He Val Ser Ala Ser Gly 
370 ~ 375 380 



Met 
385 



<210> 15 
<211> 252 
<212> PRT 

<213> Angiococcus disciformis 
<400> 15 

Met Phe Asp Ala Thr Arg Tyr Leu Glu Arg He Gly Val Ser Ala Gly 
15 10 15 

Thr Pro Leu Ala Ala Leu His Arg Ala His Leu Glu Ala Val Pro Phe 
20 25 30 

Glu Asn Leu Asp He His Leu Lys Arg Pro He Arg Leu Asp Glu Ala 
35 40 45 



Ala Phe Phe Glu Lys Val Val Leu Gin Arg Arg Gly Gly Phe Cys Tyr 
50 ^ '55 60 



Glu Leu Asn Gly Leu Phe Ala Arg Leu Leu Arg Thr Leu Gly Tyr Arg 
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7 c 80 
65 70 75 

Val Thr Leu Leu Ser Ala Arg Val Ala Ser Arg Pro Asp Gly Ser Ala 
85 90 95 

Tyr Gly Pro Asp Phe Asp His Leu Ala Leu Leu Val Glu Asp Ala Ser 
100 105 110 

Gly Arg Trp Leu Ala Asp Val Gly Phe Gly Glu Cys Phe Leu Glu Pro 
115 120 125 

Leu Arg Leu Asp Glu Arg Gly Val Gin Thr Gin Asp Gly Arg Gly His 
130 135 140 

Arg Leu Val Glu Asp Ala Glu Gly Leu Val Val Trp Arg Glu Ala Ala 
145 150 155 

Ser Gly Asp Trp Lys Ala Gin Tyr Val Val Ser Leu He Pro Arg Glu 
165 I 70 

Leu Gly Asp Phe Ala Ala Met Cys His His Gin Gin Thr Ser Pro Glu 
180 I 85 130 

Ser His Phe Thr Gin Arg Arg Leu Cys Thr Arg Thr Thr Pro Asp Gly 
195 200 205 

Arg He Thr Leu Lys Glu Gly Ala Leu Val Val Thr Ser Gly Gly Ala 
210 215 220 

Arg His Glu Gin Arg Leu Pro Asp Glu Ala Ala Trp Arg Ala Ala Leu 
225 230 235 

Thr Arg His Phe Gly He Thr Leu Pro Ser Gly Gly 
245 250 



<210> 


16 


<211> 


308 


<212> 


PRT 


<213> 


Angiococcus discif ormis 



<400> 16 

Met Thr Thr Ser Lys Val Arg Ser Gly Ala Thr Leu Gly Gly Val Thr 
1 5 10 13 
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Ala Arg Phe Ser Thr Gly Glu Gin Pro Ala Pro Lys Lys Tyr Val Asp 
20 25 30 

Tyr Thr Val Lys Ser Gly Asp Thr Phe Ser Gly He Ala Ala Lys His 
35 40 45 

Glu Leu Ser Leu Ala Ala Leu Lys Lys Leu Asn Pro Gin Val Glu Asn 
50 55 60 

He Asn Val He His Pro Gly Gin Lys Leu His Val Lys Val Val Asn 
65 70 75 80 

Ala Pro Ala Pro Glu Pro Lys Pro Thr Pro Lys Pro Pro Thr Asn Lys 
85 90 . 95 

Asn Thr Ser Val Leu Pro Lys Gly He Pro Asn Thr Glu Gly Met Ser 
100 105 HO 

Gin Ala Lys Glu Tyr Ala Leu Tyr Ser Lys Tyr Val Glu Lys Tyr Gly 
115 120 125 

Asp Ala Lys Ala Lys Gin Asp Leu Ala Ala Gly Lys Arg Val He Val 
130 135 140 

Gly Leu Arg Val Asn Thr Pro Phe Thr Arg Asp Arg Pro Ser Gly Gly 
145 " 150 155 160 

Ser Tyr Asp Asp Arg Leu Val Val Met Trp Lys Asp Ser Ser Gly Lys 
165 170 . 175 

Pro His Val Gin Glu Phe Lys Ala Asn Thr Glu Pro Asn Arg Arg Trp 
180 185 190 

Ala Asp Asp Pro Ser Gin Ser Thr Lys Pro Val Gly Arg Leu Val Gly 
195 200 205 

Asn Lys Thr Tyr His Tyr Arg Lys Ser Phe Asn Gly Asn Phe Gly Gly 
210 " 215 220 

Asn He Leu Ala Pro Asp Leu Arg Tyr Gly Asn Pro Thr Val Arg Arg 
225 230 235 240 

Asp Thr Asn Arg Asp His Arg He Asn Ser Lys Asp Asp Val Phe Ser 
245 250 255 



Gly Asp Trp Gly Gly Gin Gly Tyr Tyr Phe His Arg Gly Gly Thr Thr 
260 265 270 
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Asp Thr Tyr Ser Ala Gly Cys Gin Thr Met Asp Gin Gly Arg Phe Asn 
275 280 285 

Asn Phe Trp Ala Ala Leu Gly Ser Gin Ser Glu Phe Ser Tyr Val Leu 
290 295 300 

Ala Gin Val Gly 
305 

<210> 17 
<211> 293 
<212> PRT 

<213> Angiococcus disciformis 



<400> 17 

Met Thr Arg Val Glu Gin Leu Arg Asp Leu Glu Thr Ala Lys Gin Gly 
15 10 15 

Ala Ala Leu Leu Glu Ala Glu Asn Ala Arg Leu His Gin Arg Leu Glu 
20 25 30 

Ala Leu Val Ala Glu Asn Ala Arg Leu Lys Gly Glu Asp Ala Gin Ala 
35 40 45 

His Leu His His Ala Asp Glu Thr His Trp Leu Leu Leu Asp Lys Gly 
50 55 60 

Pro Gly Thr Lys Trp Tyr Ala Trp Thr Val Ala Ser Pro Asp Thr Val 
65 70 7.5 80 

Phe His Arg He Tyr Pro Ser Arg Ser Gly Ala Thr Ala Arg Thr Val 
85 90 95 

Leu Gly Asp Tyr Glu Gly Val Val Leu Val Asp Gly Tyr Ala Ala Tyr 
100 ' 105 HO 

Glr Thr Ala Thr Lys Ser Ser Ala Asp Gly Pro Cys Pro Ala Thr Leu 
115 120 125 



t 1 



P^e Cys Trp Ala His Val Arg Arg Lys Phe Phe Glu Ala Gin Gin 
l^n ~ 135 140 



iia Pro Ala Cys Lys Glu Val Leu Asp Leu He Gly Glu Leu Tyr 
145 " 150 155 160 
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Ala Val Glu Ala Asp Leu Pro Gly Trp Tyr Ala Leu Glu Gly Glu Glu 
165 170 175 

Arg Gin Ala Ala Leu Ala His Arg Leu Ala Val Arg Gin Gin Lys Ser 
180 185 190 

Ala Pro Leu Thr Glu Arg lie Arg Asp Trp Ala His Ala Gin Arg Ala 
195 200 205 

Leu Pro Gly Ser Ala Phe Arg Lys Ala Leu Glu Tyr Met Leu Asn Leu 
210 215 220 

Tro Ala Gly Leu Thr Val Phe Leu Thr Gin Pro Gin Val Pro Leu Asp 
225 230 . 235 240 

Asn Asn His Val Glu Arg Gin Leu Arg Asp Met Val lie Gly Arg Lys 
245 250 255 

Asn Leu Lys Thr Ala Arg Leu Arg Gly Glu Ala Pro Gly His Tyr Leu 
260 " 265 270 

Arg Arg Pro Ala Leu Ala Ala He Glu Asn Pro Gly Thr Val Thr Leu 
275 ' 280 285 

Pro Lys Ala Gin Ser 
290 

<210> 18 

<211> 82868 

<212> DNA 

<213> Angiococcus disciformis 



<220> 

<221> CDS 

<222> (72) . . (1148) 

<223> ORF24 



.<220> 
<221> CDS 
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<222> (1550) . . (2812) 
<223> ORF23 

<220> 

<221> CDS 

<222> (7660) . . (8214) 

<223> ORF19 

<220> 

<221> CDS 

<222> (10083) (11138) 

<223> ORF18 

<220> 

<221> CDS 

<222> (11229) . . (12284) 

<223> ORF17 

<220> 

<221> CDS 

<222> (47447) . . (48664) 

<223> ORF1 

<220> 

<221> CDS 

<222> (48674) . . (49795)' 

<223> tubZ 



<221> CDS 

<Z22> (49813) . . (50472) 
<223> ORF2 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



CDS 

(61715) . . (64060) 
ORF7 

CDS 

(68036) . . (69034) 
ORF10 

CDS 

(70676) . . (72319) 
ORF12 

CDS 

(73312) . . (75552) 
ORF13 

CDS 

(78163) . . (79416) 
ORF15 

CDS 

(79561) . . (82866) 
ORF16 
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<400> 18 

attacgaatt cgcggccgca attaaccctc actaaaggga tcaatcgaaa gcacctcgag 60 

attcgaggtg t gtg ggg cgt gga ctt cag cca ctt cca ccg gat get ccg 110 
Val Gly Arg Gly Leu Gin Pro Leu Pro Pro Asp Ala Pro 

15 10 

cat cct cga ccg gca egg tgc gag ggt cgc egg tgg caa ggg ctg gga 158 

His Pro Arg Pro Ala Arg Cys Glu Gly Arg Arg Trp Gin Gly Leu Gly 

15 20 25 

gca cat ggt cag ccg cac eta egg cct cga aga ggt gaa tea ggc cct 206 

Ala His Gly Gin Pro His Leu Arg Pro Arg Arg Gly Glu Ser Gly Pro 

30 35 40 45 

ggc cga tgt cga gac ggg teg cat cgt gaa ggg act cat ccg ccc caa 254 

Gly Arg Cys Arg Asp Gly Ser His Arg Glu Gly Thr His Pro Pro Gin 

50 55 60 



tec cgt teg tac etc gcg atg acg gee gat teg atg gat gee acc gtg 302 

Ser Arg Ser Tyr Leu Ala Met Thr Ala Asp Ser Met Asp Ala Thr Val 
65 70 75 

ccc teg egg agg cca gee ctg ggc ttc ate ggc ggt ggg gtg atg gee 350 

Pro Ser Arg Arg Pro Ala Leu Gly Phe lie Gly Gly Gly Val Met Ala 
80 . 85 90 

gag gee ttg att gga gcg gtc acc ggg gga ggc ctg gee gcg ccc ggc 398 

Glu Ala Leu lie Gly Ala Val Thr Gly Gly Gly Leu Ala Ala Pro Gly 
95 100 105 

acg etc etc gtc gga gag gtc gac gag gag cgt cgc egg gtg etc cgc 44 6 

Thr Leu Leu Val Gly Glu Val Asp Glu Glu Arg Arg Arg Val Leu Arg 

110 115 120 125 

gag cag tat ggc gtc gag gtg acc gcg gac aac etc gcg gcg gcg gac 4 94 

Glu Gin Tyr Gly Val Glu Val Thr Ala Asp Asn Leu Ala Ala Ala Asp 

130 135 140 

gcg gaa ctg gtc gtg ctg gee gtg aag ccg caa cac ctg ggt gtt gtc 542 

Ala Glu Leu Val Val 'Leu Ala Val Lys Pro Gin His Leu Gly Val Val 
145 150 155 

etc cag cag ctg cgc ggg cag etc cgt gag gac acg att gtc gtc age 590 

Leu Gin Gin Leu Arg Gly Gin Leu Arg Glu Asp Thr lie Val Val Ser 
160 ^ 165 170 

ate ate gcg ggc gcg cgc ctg cag acg ctg tgc gag ggt etc gaa cac 638 

He He Ala Gly Ala Arg Leu Gin Thr Leu Cys Glu Gly Leu Glu His 
175 ~ 180 185 

cca etc gtc gtt cgc gcg atg ccc aac acc ccc gcg egg gtc cgc aag 686 

Pro Leu Val Val Arg Ala Met Pro Asn Thr Pro Ala Arg Val Arg Lys 

190 195 200 205 

gcg gec acc ttc tgg att cca age gcg ggc ttg age gag ccg gcg ctg 734 

Ala Ala Thr Phe Trp He Pro Ser Ala Gly Leu Ser Glu Pro Ala Leu 

210 215 220 



gtg cgc gtc cgc tea ctg ctg ggc gcg etc ggc acc gaa gtg gag gtc 
Val Arg Val Arg Ser Leu Leu Gly Ala Leu Gly Thr Glu Val Glu Val 
225 230 235 



782 
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gcc gac gag gcc ggc gtg gag atg gcg acg ggc etc gec ggc ccc atg 
Ala Asp Glu Ala Gly Val Glu Met Ala Thr Gly Leu Ala Gly Pro Met 
240 245 250 

ccc gcg ttc gtc tat ttc etc ate gag gcg ttc ate gac gcg ggc gtg 878 
Pro Ala Phe Val Tyr Phe Leu lie Glu Ala Phe lie Asp Ala Gly Val 
255 260 265 

gcc ctg ggg ctg ccg cgt gat cag gcc acc ctg gcg acg gtg gag age 926 
Ala Leu Gly Leu Pro Arg Asp Gin Ala Thr Leu Ala Thr Val Glu Ser 
270 275 280 285 

atg cgc gga age ctg gag ttg ctg cat cgc age cag gag tec ccg gcg 974 
Met Arg Gly Ser Leu Glu Leu Leu His Arg Ser Gin Glu Ser Pro Ala 
290 295 300 

gcg ttg cgc aag cag gtc acg age cct ggt ggc gcg acg ctg gcc ggc 1022 
Ala Leu Arg Lys Gin Val Thr Ser Pro Gly Gly Ala Thr Leu Ala Gly 
305 310 315 

ttg aag gtg ttc gag age gcg ggc gtc egg gat gcg atg gac gcg gcg 1070 
Leu Lys Val Phe Glu Ser Ala Gly Val Arg Asp Ala Met Asp Ala Ala 
320 325 330 

gtg cgc gcg gtg cac gcg cgc gcg cag gag etc ggg egg ctt ccg etc 1118 
Val Arg Ala Val His Ala Arg Ala Gin Glu Leu Gly Arg Leu Pro Leu 
335 340 345 

gcc ccg etc gag ccc ggg aaa gag ggc tga ctcctggtaa ctcctcgccg 1168 
Ala Pro Leu Glu Pro Gly Lys Glu Gly 
350 355 



taccccctca 


ttgccgaggt 


gacggcccgc 


tcacgccagc 


agegecgaga 


gegggaaega 


1228 


cgcaagcgtg 


aggcttcget 


egggagegge 


ggctaaaccg 


gaagcggatg 


agagtccaag 


1288 


agaacccctc 


atacgacaac 


tategtagtt 


ctcaccgccc 


egaagecttt 


aegaeggagg 


1348 


aaggacggtg 


ttcccctacg 


egggtcegtg 


ggcaatccgt 


gcccgggacc 


tctggagttc 


1408 


cccatgcgag 


gttcgttgca 


gtccttttcc 


ctggcgatgg 


tgttcctcgt 


gggtctgetc 


1468 


gcgggccgtc 


cctcgtcagc 


ccatgaatcg 


ctccgtgagc 


geatcgatge 


cttcgtgcgc 


1528 


geggagcaga 


aacggatgaa 


t gtg ccg ggc gtc gcg 
Val Pro Gly Val Ala 
360 


gtg gcc gtc gtg age 
Val Ala Val Val Ser 
365 


1579 



cac ggc agg gtc ate gtg gcg aag ggg tat ggt ttc teg aac ctg gag 1627 

His Gly Arg Val lie Val Ala Lys Gly Tyr Gly Phe Ser Asn Leu Glu 
370 375 380 

cat cag gtt ccc gtc acc ccg gac acc etc ttc cag tec ggc tec ctg 1675 

His Gin Val Pro Val Thr Pro Asp Thr Leu Phe Gin Ser Gly Ser Leu 

385 390 395 400 

ggc aag cag ttc acc gcc atg gcg gtg atg etc cag gtg gag gcg ggc 1723 

Gly Lys Gin Phe Thr Ala Met Ala Val Met Leu Gin Val Glu Ala Gly 
405 410 . 415 

cgc ate teg ctg tec gac age ate acg aag tac ttc ccc gac gcg ccc 1771 

Arg He Ser Leu Ser Asp Ser He Thr Lys Tyr Phe Pro Asp Ala Pro 
420 425 430 
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teg acc tgg gec ccc ate acg att cgc cac ctg ctg acg cac acc tec 1819 
Ser Thr Trp Ala Pro He Thr He Arg His Leu Leu Thr His Thr Ser 
435 440 445 

ggc etc teg gat ggg ctg etc gac ctg cgc aag gac tac acc gac gag 1867 
Gly Leu Ser Asp Gly Leu Leu Asp Leu Arg Lys Asp Tyr Thr Asp Glu 
450 455 460 

gag ttc gcg cag ttc etc tat acg ctg ccc ctg gac ttc ccc gec ggc 1915 
Glu Phe Ala Gin Phe Leu Tyr Thr Leu Pro Leu Asp Phe Pro Ala Gly 
465 470 475 480 

ctg cga tgg aac tac age aac gec ggc tac gtc ctg ctg ggc etc ctg 1963 
Leu Arg Trp Asn Tyr Ser Asn Ala Gly Tyr Val Leu Leu Gly Leu Leu 
485 490 495 

gtg aac cgc gtc gcg ggc acc tec tac gtc aac gtc ctg ggc gag cag 2011 
Val Asn Arg Val Ala Gly Thr Ser Tyr Val Asn Val Leu Gly Glu Gin 
500 505 510 

gtc ttc aag ccc acc cgc atg aag acg gcg cga ggc ate agt gag get 2059 
Val Phe Lys Pro Thr Arg Met Lys Thr Ala Arg Gly lie Ser Glu Ala 
515 520 525 

gac att gtt ccc aat cgc gcg gee ggt tac cgc ctg etc gac ggg gtg \ 2107 
Asp He Val Pro Asn Arg Ala Ala Gly Tyr Arg Leu Leu Asp Gly Val 
530 ~ 535 ^ 540 

gtg aag aac cag gag tgg gtc tec ccc tec etc aac acc acg gcg gat 2155 
Val Lys Asn Gin Glu Trp Val Ser Pro Ser Leu Asn Thr Thr Ala Asp 
545 550 555 560 

ggc teg etc tac ttc tec ctg aaa gac atg ctg gcg tgg gat gac gee 2203 
Gly Ser Leu Tyr Phe Ser Leu Lys Asp Met Leu Ala Trp Asp Asp Ala 
565 570 575 

gtg gat gac cgc gee ate ctg acg aac ggg age tgg egg gac ate ctg 2251 
Val Asp Asp Arg Ala He Leu Thr Asn Gly Ser Trp Arg Asp He Leu 
580 585 590 

agt ccc gtg aag etc tec age gga get tec tat ccg tat ggc ttc gga 2299 
Ser Pro Val Lys Leu Ser Ser Gly Ala Ser Tyr Pro Tyr Gly Phe Gly 
595 600 605 

tgg ttc ate gag gag cgc aac ggc cag etc gtg cac cag cac gee ggt 2347 
Trp Phe He Glu Glu Arg Asn Gly Gin Leu Val His Gin His Ala Gly 
610 615 620 

gca tgg cag ggc ttc aag acc gec tac tac cgc ttc acc gga gac gee 2395 
Ala Trp Gin Gly Phe Lys Thr Ala Tyr Tyr Arg Phe Thr Gly Asp Ala 

630 635 640 

ctg tec ate ate gtg ctg etc aac etc gcg gag ccg aac ccc gcg gee 2443 
Leu Ser He He Val Leu Leu Asn Leu Ala Glu Pro Asn Pro Ala Ala 
645 650 655 

itc gec aat ggc ate gec gec ate gtc aac ccc gcg etc gec gtg cct 24 91 

r;o Asn Gly He Ala Ala He Val Asn Pro Ala Leu Ala Val Pro 

660 665 670 

r-i-rj ccc att ccg gac etc gaa ccg gag gtc acg gec egg etc 2539 

Pro Leu Ala Pro He Pro Asp Leu Glu Pro Glu Val Thr Ala Arg Leu 
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675 680 685 

gcg acg ctg etc gag cag gca egg gec gga gcg ctg aac cct gec gag 2587 
Ala Thr Leu Leu Glu Gin Ala Arg Ala Gly Ala Leu Asn Pro Ala Glu 
690 695 700 

ttc gcg tac gtc ccc ggg tgg ttc ttc acg gag gec gcg ccg tat tac 2635 
Phe Ala Tyr Val Pro Gly. Trp Phe Phe Thr Glu Ala Ala Pro Tyr Tyr 
705 710* 715 720 

cag teg ctg eta cag age ctg gga ccg teg ggg ccg etc gtc ctg gcg 2683 
Gin Ser Leu Leu Gin Ser Leu Gly Pro Ser Gly Pro Leu Val Leu Ala 
725 730 735 

aag egg gag atg egg ggc gat gac cgc gtc tac acg tac ctg gtg cag 2731 
Lys Arg Glu Met Arg Gly Asp Asp Arg Val Tyr Thr Tyr Leu Val Gin 
740 745 750 

gec ggc tec acg atg ctg cgc tac egg gtg gcg etc ate ccg gat ggc 2779 
Ala Gly Ser Thr Met Leu Arg Tyr Arg Val Ala Leu lie Pro Asp Gly 
755 760 765 

egg gtg tec gee ttc gcg ctg age ccg aac taa gagctgaccc gcccgggccg 2832 
Arg Val Ser Ala Phe Ala Leu Ser Pro Asn 



770 




775 








cacageggga 


gcggtggtca 


gctctgggcc ttcggcagcg 


tcacggtgcc 


eggattcteg 


2892 


atggcggcga 


gcgcagggcg 


cegcaggtag tggcccgggg 


octet cct eg 


cagccgcgct 


2952 


gtcttgaggt 


tcttcctgcc 


tatcaccatg tcacgcagct 


ggcgctcgac 


gtggttattg 


3012 


tccagcggca 


cctgcggctg 


ggtgaggaag accgtcagcc 


ccgcccagag 


gttgagcatg 


3072 


tactcgaggg 


ecttgegaaa 


ggcactgccg ggcagtgcgc 


getgegegtg 


cgcccagtcg 


3132 


eggatgeget 


eggtcagegg 


cgcggacttc tgetgacgea 


cggcgagccg 


gtgegegage 


3192 


gcggcctgac 


gctcctcgcc 


ctccagcgcg taccagcccg 


gcaggtccgc 


ctcgacggcg 


3252 


tacagctcgc 


cgatgaggtc 


cagcacctct ttgcaggcag 


gggcaaactg 


ctgggcttcg 


3312 


aagaattttc 


tcctcacgtg 


ggcccagcag aaggccaggg 


tggctggaca 


gggcccgtcc 


3372 


gcgctggact 


tcgtcgccgt 


ctggtacgcc gcatacccat 


ccaccagcac 


gacgccctcg 


3432 


tagtcgccca 


gcacggtgcg 


cgccgtcgcc ccggagcggc 


tggggtagat 


gcgatggaag 


3492 


acggtgtcgg 


gcgaggccac 


cgtccacgcg taccacttgg tgcccggccc 


cttgtccagc 


3552 


ageagecaat 


gcgtctcatc 


cgcgtggtgc aggtgegect gggcgtcctc gcccttgagc 


3612 


egggcattet 


ccgcgacgag 


cgcctcaagg cgctggtgca 


geegggegtt 


ttccgcctcc 


3672 


agcagcgccg 


ccccctgctt 


ggccgtctcc aagtctcgea 


gctgctcgac 


gcgcgtcatg 


3732 


actacctgac 


aggtgeggea 


ggacgctcgc tgacgtgcac 


cggcaccaga 


gccgcgccct 


3792 


teteggeggg 


etteggaegg 


cggtgtcccg cggggcgccc 


cttgcgccac 


tegtagageg 


3852 


tcggcccgga 


cacgcccagt 


tgctgcgccg cgtccgtcac 


cgtcccaccc 


ttcgccatgg 


3912 


tgtgctcggc 


gtagcgcaec 


gegaaggege gcagcgcctc 


ggggaatggc 


aacgagccac 


3972 
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cgcgccgccc 


ggccttcaat 


cgctgcgctt 


cctgacggaa 


ctgctccagc tccttctcca 


tggaccacct 


cctqqcqtgt 


ggccaaggga 


agtctcacga 


aggctggagc ccgttacgac 


crqcrcrcaccrac 


gaggtcttac 


gactcctgct 


cgcgcggcgg 


gtgtccgggt cttccggatg 


aacftaaatcc 

3 3 '—33 3 www 


cctacgccac 


gtcagccgac 


ctgggccagg 


acgtagctga actcgctctg 


aqaaccaacrc 


gcggcccaga 


agttgttgaa 


gcggccctgg 


tccatcgtct ggcagccggc 


qgacrtacatq 


tcggtcgttc 


cgccgcgatg 


gaagtagtag 


ccctggccgc cccagtcgcc 


gctgaacaca 


tcgtccttcg 


agttgatccg 


gtgatcgcgg 


ttggtgtcgc . gccgcacggt 


cgqqttcccg 

"33 3 


tagcgcaggt 


ccggcgcgag 


gatgttgcca 


ccgaagttgc cgttgaagct 


cttccggtag 


tggtaggtct 


tgttgccgac 


gaggcggccg 


accggcttgg tgctctggct 


cgggtcatcg 


gcccagcggc 


ggttgggctc 


ggtgttggcc 


ttgaactcct ggacgtgcgg 


cttgccgctg 


ctgtccttcc 


acatgacgac 


gagccgatcg 


tcgtaggagc cgccggaggg 


gcgatcacgg 


gtgaagggcg 

3 3 3 3 3 3 


tgttcaccct 


caggccgacg 


atgacgcgct tgccggccgc 


qacfotcctcrc 


ttcgccttgg 


cgtcgccgta 


cttctccacg 


tacttcgagt agagcgcgta 


ctccttcgcc 


tgagacatgc 


cctcggtgtt 


cgggatgccc 


ttggggagca cgctggtgtt 


tttgttggtc 


ggcggcttcg 


gtgtcggctt 


cggctcaggc 


gcgggcgcat tcacgacctt 


cacgtgcagc 


ttctggcccg 


ggtggatgac 


gttgatgttc 


tcgacctgcg ggttcagctt 


cttcaatgcg 


gccagcgaga 

3 3 3 3 


gctcgtgctt 


cgccgcgatg 


cccgagaacg tgtcaccgct 


cttcacggtg 


taatccacgt 


acttcttggg 


ggccggctgc 


tccccggtgc tgaacctggc 


agtgactcca 


cccagcgtgg 


cgccggaccg 


gaccttggac 


gtcgtcattc* ggggcactcc 


tgaaaccgaa 


ggtgtctggc 


tggattgtcg 


aaggtgcggg 


ctcgaagttt cttagaattt 


gatccatcac 


cggccccagt 


gccagcaggg 


caaacggcga 


gtcctttgga tttctacgct 


cagtgaagtg 

3 3 3 3 


ctggcggcct 

^33 33 


gcggacccgt 


cccggggcct 


tcacggcggc aggtagagtg 


ggccggctcc 


tcatgtgcgt 


ccggagcggc 


gtcatccgcc 


ggagggcagg gtgatgccga 


aqtgccgcgt 

»»* 3 3 3 


gagcgcggcc 

3 3 3 3 3 


cgccacgcgg 


cctcgtccgg 


gagccgctgc tcgtggcgcg 


cgccccccga 


ggtgacgacg 


agcgcgccct 


ccttgagggt 


gatgcgcccg tccggcgtgg 


tccgggtgca 


gaggcgtcgc 


tgcgtgaagt 


gggactccgg 


cgaggtctgc tggtggtggc 


acatggccgc 


gaagtccccg 


agctcgcgcg 


ggatgagcga 


gacgacgtac tgggccttcc 


agtcccccga 


cgccgcctcg 


cgccagacga 


cgagcccctc 




ggccgcgtcc 


gtcctgggtc 


tgcacgccgc 


gctcgtcgag 


gcgcagcggc tccaggaagc 




gcccacgtcg 


gccagccagc 


ggcccgaggc 


gtcctccacc aggagcgcca 




gtcgggaccg 


taggcgctgc 


catcgggacg 


cgaggccacg cgcgccgaca 



4032 
4092 
4152 
4212 
4272 
4332 
4392 
4452 
4512 
4572 
4 632 
4692 
4752 
4812 
4872 
4932 
4992 
5052 
5112 
5172 
5232 
5292 
5352 
5412 
5472 
5532 
5592 
5652 
5712 
5772 
5832 
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gcagcgtcac 


ccggtaaccg 


agagtccgga 


143 

geaggeggge 


gaagaggeca ttgagctcgt 


5892 


agcagaaccc 


accgcgccgc 


tggaggacga 


ccttctcgaa 


gaaggccgcc tcatcgagcc 


5952 


ggatgggccg 


cttcaggtgg 


atgtcgaggt 


tetcgaaegg 


cacggcctcg aggtgggccc 


6012 


gatggagcgc 


ggcgagcggc gtacccgccg 


agacaccgat 


acgctccagg tagcgagtgg 


6072 


cgtcgaacat 


gcgcactgtg taccgcggtg 


ggcccgttgg 


gggcgaccca ttctccaatc 


6132 


catgggcggc 


ccgccattcc cctggcgcga 


acgcgccggg 


gagggctege gaaggaagee 


6192 


cggcacgccc 


cggagcccga 


gcggcgctga 


cctacatccc 


cgacgcgctc aegatgageg 


6252 


aggacatttt 


gtcgttccag 


ctgtcgtcga 


egagegagge 


atcgtccgcc gtcttggtga 


6312 


ggctggcacc 


gctgaagttg< 


• tcatgttcat 


acagggtgac 


cttgtagccg ctggcgacct 


6372 


tcagggaggt 


gatgtcgtca 


ttgcgcacgc 


cccaggcatg 


cagggeggae agggtgtaac 


6432 


ggccaacggg 


cagtgccatt 


ccatagccac 


cgtagttctg 


gtcctggaac acggtgacag 


6492 


ccgtggggcc 


cgggggggtg 


taggtgaagg 


ggaagtcacg 


ctgagcctgg gtgaactgcg 


6552 


cttcccactc 


ggcgggccat 


ccgaatgcgg 


acgtggccag 


ggtcttgaga ttgacgcccg 


6612 


cggcaccact 


ccagaagtgg 


atgaactctc 


cccagttcag 


gcttcgcgcg tagttgctgc 


6672 


cgttcttcgg 


caggtactgc 


gcgagcagga 


cgaagtaccg 


gttgagcacc gaggagcege 


6732 


cgtggttctt 


gtagatggga 


tagaaccaat 


cccggaacca 


gtgggtgttg gcgcggggga 


6792 


agctgtccgt 


cgtgttgacc 


atcaggttgt 


accagegatt 


ggcatcactg gtgcgaccca 


6852 


acgccaggta 


gacgtcgtag 


ttgaagatct 


ccgcccactt 


gctgtcgcgc cacaggccga 


6912 


aggcaggcga 


gtcgtgcacg 


cccttgctgg 


cgagttccac 


gatgtgggcc acctcatggg 


6972 


tgacgatgtc 


caggtcgttg 


ccggtgccgg 


cagtccaggc 


cgtgggggaa ctggaaccca 


7032 


catcaatcac 


gttgcgcctg 


tcatggctgg 


cgtcgaagta 


cgtggagggg tggcctccac 


7092 


tgtactttcc 


tgcgtggaag 


atggtgtaga 


getgegagtc 


cgtgccgaag tgtcegtagg 


7152 


tcctcttggt 


gtatcgccac 


acgtcgccca 


ggtaggtgtt 


gggccaggtg atggaccggt 


7212 


tcacggccga 


gtcgaagtac 


acggccacgt 


cattgtcctg 


gtagacgegg gagaeggtet 


7272 


ggttgtgctc 


gaaccagtgc 


tccgtccagg 


tctcgggcgc 


ttgcgcctgg. acggaagtgg 


7332 


cggagagcat 


cggcaacacg 


aagcacaacg 


aagcaggggg 


gaacttcatg ggggagactc 


7392 


ctgtgcgcct 


ggggatgcgt 


gggcttccag 


tgacctcccc 


atagagaatc gagccccttt 


7452 


ttcgggttaa 


cgcgaagttt 


atcggaatga 


gaeggacage 


gagttgaccc ggttcccgtg 


7512 


ggaacagctc 


gccggcagtt 


gagagtgagg 


gaacccgccg 


teagegegtg gtgeggaage 


7572 


ccgcgtaggg 


ggacaccgtt 


tttttgcgcg 


gcgcttcggg 


ccagggcatt cctggggcct 


7632 


ccctctctct 


cgaaggagca 


gtacccg gtg tec ate cgt cga acg acc gec age 
Val Ser lie Arg Arg Thr Thr Ala Ser 
780 785 


7686 



WO 2004/022586 



144 



PCT/EP2003/009780 



acc cgc aat gtc acc gtc agt gcg atg etc gac 
Thr Arg Asn Val Thr Val Ser Ala Met Leu Asp 
790 795 

gcg cgc gac gcg etc cag gtc ttc etc gag aaa 
Ala Arg Asp Ala Leu Gin Val Phe Leu Glu Lys 
805 810 

ttc ggc gec gac tgg cag ggg acg ctg gag cgc 
Phe Gly Ala Asp Trp Gin Gly Thr Leu Glu Arg 
820 " 825 830 

ggc cgt ccc acg age gag cgc tgg gac tgg ggc 
Gly Arg Pro Thr Ser Glu Arg Trp Asp Trp Gly 
840 845 

etc ggg cac ttc cgc teg ccg cag gec acg ggc 
Leu Gly His Phe Arg Ser Pro Gin Ala Thr Gly 
855 860 

gag ccc ate gtc gcg etc gee egg etc gec aag 
Glu Pro lie Val Ala Leu Ala Arg Leu Ala Lys 
870 875 

ttc cag cac gag gaa etc acg gac gcg cag gtg 
Phe Gin His Glu Glu Leu Thr Asp Ala Gin Val 
885 890 

ggg etc gcg ctg ctg cag gag gcg ctg ggc aac 
Gly Leu Ala Leu Leu Gin Glu Ala Leu Gly Asn 
900 905 910 

gag acg cgc gee gcg etc gag gtg tgg gcg ggc 
Glu Thr Arg Ala Ala Leu Glu Val Trp Ala Gly 
920 925 

gcg aag cgt ccg gcg cgc aag gee ccg gcg gec 
Ala Lys Arg Pro Ala Arg Lys Ala Pro Ala Ala 
935 940 

ccc gcg gcg aag aag aag gcg ccg aag gcg aag 
Pro Ala Ala Lys Lys Lys Ala Pro Lys Ala Lys 
950 955 



ccg gec ctg ggc age 7734 
Pro Ala Leu Gly Ser 
800 

gec ctg cgc gee cag 7782 

Ala Leu Arg Ala Gin 

815 

gac egg ctg gcg cag 7 830 

Asp Arg Leu Ala Gin 
835 

age ate age tac egg 7878 
Ser lie Ser Tyr Arg 
850 

age ggc gtg gtg acc 7926 
Ser Gly Val Val Thr 
865 

gcg tat cgc aac etc 7 974 

Ala Tyr Arg Asn Leu 
880 

cgc cac gee atg gag 8022 

Arg His Ala Met Glu 

895 

aag acg cgc gcg gee 8070 
Lys Thr Arg Ala Ala 
915 

ccg ccg gee cgc gee 8118 
Pro Pro Ala Arg Ala 
930 

gcg aag cgt gcg aag 8166 
Ala Lys Arg Ala Lys 
945 

age gec egg cga tag 8214 
Ser Ala Arg Arg 
960 



ggcagggtgg 


gccgcggggc 


ctcagccgtg 


cgccgaaccc gcgcggctcg 


ggccctggag 


8274 


gggccatcag 


tcgttgtacc 


gggttgaggg 


caacgtacgc tcccttgagg 


tggtgggccc 


8334 


gtgggtgagc 


tacgagcaga 


eggatctgag 


aaggeggteg tegeggcatt 


caaggeggae 


8394 


agettctegt 


ccgcaggcac 


atccgtgacg 


acgctgcgcg aatccacctc 


gaacaaccac 


8454 


cgggcctggg 


tacgagggtg 


gggagggcag 


agtcaggagc cattggctgc 


accggggccg 


8514 


gtgacgtccg 


tgaacttege 


gccggtgacg 


cccgcgcgct ccagggcttc 


cttgatctcc 


8574 


tcggccacga 


tcaaggccac 


ggtccagccc 


caggttcgga acaccttggc 


gccgtccacc 


8634 


ttcgtcgtgt 


egatgegcat 


cccggacacc 


gaggagtaca cgggcgtgga 


egaeggggat 


8694 


getgaaggee 


gcgtgggaat 


agtcgagegg 


tttgccggtg atcttgatgg 


ggatcctgat 


8754 
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gcgctcgtcc gaaagtgatt tggacgacgc atgcggctcc actcagtgac aagcagaagc 

agccggttcc cggctacggc ggcctggccg tcacgggccg gtgcggcaac ctcgaacaat 

ccatgagcgt cgaggtgccg cgtatctttc cgggaggcgt cttccctgtg tggaaggggc 

tgctgttcga tcccgcctcg tgggatggct cggacttctt catgccggcg caacgcttcg 8994 

ggttcgtgtt ctgggtggaa gacgatttga gaggatccca ttcgcattga cgtggcacac 9054 

gcgaaatcga tttccctgaa atacttcatt accaggctgt atcacaatca caccagtaaa 9114 

gtaaggctgg cacctccggg gcggcccggt cgcgtggaac ttcacgtctc gaagagaagc 9174 

cagcacacga tgattcgtcc gatcctggcc cggatctgtc acacgaaggg acgcgccggt 9234 

caggagggag ttgggatttc gcgcaactgt ttggcaatgc gcgcgaaaag gtctttcacc 9294 

ggcgccgcca cgcttggagt ggcccagcaa ccatgggatt ggtggtccgt catgccttgg 9354 

cagggggatt gcccgtggga gcatctactt cgctagatgg ttctccacct gacggttggc 9414 

tgctcccaag tctgtgagcg caggtggggc gcacgcacgc aacgcccatt tagatagctg 9474 

cctggcagcc cgttgctttt ctggctactg cacttccccg atcccctccc. atcgttagga 9534 

cgcgcaatgc cttggaggag cggcttggat tttcacggcc ttctacgcac ggagcccaag 9594 

gtcaatcgca cactccaggg caaacgatga atatgaacat gcgagcgcaa gcgggtggaa 9654 

gacacctgag gtgccatgta cagtgagtga ggaatcccgg ctgtgcatga cactttcgtt 9714 

ggggagggcc gaaacgtgga ctacagggac gcggtacaaa tacaaggaca gacgcccagt 9774 

acggcggcgg cagtgacgca gaagatgccc gtggcagggc tgggcaggta cctgagggcc 9834 

atggagacgc tccgactcgg ggccactcgg gcaagcaggg agttgctgcg cgccatgctg 9894 

gaggtggtgg agcaggattt ggaagacgcg gagtcccatg cacagagcgc ggtgccggtg 9954 

gacaagcatg gagtccccca tccctccggc aagccagtac ctgcctcgcc ctcgaagcgc 10014 

ttccgtgggg tacctcccct tcgtgagggc agtgtgcccc ccgagagccc tctgcctgga 10074 

gtgacgaa atg act teg ggc act egg aaa aca gga ggc gac ggg get cgc 10124 
Met Thr Ser Gly Thr Arg Lys Thr Gly Gly Asp Gly Ala Arc 
965 970 975 



egg aat gac aag tac cgc ate etc age ctg gac gga gga gga ctg egg 
Arg Asn Asp Lys Tyr Arg He Leu Ser Leu Asp Gly Gly Gly Leu Are 
980 985 ' 990 



10172 



acg ctg ctg tec att ggc ctg ttg aag cgc ate gac cag ttg egg ccc 10220 
Thr Leu Leu Ser He Gly Leu Leu Lys Arg He Asp Gin Leu Arg Pro 
995 1000 1005 

ggc ttc ctt gat cag gtc gac ctg gtg gcg ggc ace teg gcg gga 10265 
Gly Phe Leu Asp Gin Val Asp Leu Val Ala Gly Thr Ser Ala Glv 
1010 1015 1020 

gee ate age gcg etc ate ate gcg gcg get cgc gag cct gec gtg 10310 
Ala He Ser Ala Leu He He Ala Ala Ala Arg Glu Pro Ala Val 
1025 1030 1035 
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ggg etc gag cag gcg egg cag ate tgg ttc acg cct ggg ctg ttc 
Gly Leu Glu Gin Ala Arg Gin lie Trp Phe Thr Pro Gly Leu Phe 
1040 1045 1050 

gat age cca etc tec aat cag ctg ggg gec ctg gtt ggc cag age 
Asp Ser Pro Leu Ser Asn Gin Leu Gly Ala Leu Val Gly Gin Ser 
1055 1060 1065 

gcg ctg atg ccc age gag aac atg get cgc gcg etc acc cac ate 
Ala Leu Met Pro Ser Glu Asn Met Ala Arg Ala Leu Thr His lie 
1070 1075 1080 

ctg gga gac aag acg etc egg gac ctg aag cgc aag gtg gtg ate 
Leu Gly Asp Lys Thr Leu Arg Asp Leu Lys Arg Lys Val Val lie 
1085 1090 1095 



10355 



10400 



10445 



10490 



ccc gee ttc cag ctg gac gat 
Pro Ala Phe Gin Leu Asp Asp 
1100 1105 



ggc gac cct gat gag 
Gly Asp Pro Asp Glu 
1110 



gac agg cgc 
Asp Arg Arg 



10535 



ggg tgg agg ccg cga ate ttc cac aac ttc ccg ggt 
Gly Trp Arg Pro Arg lie Phe His Asn Phe Pro Gly 
1115 1120 1125 



gac act ttc 
Asp Thr Phe 



gtc aac ctg gat gac tac ctg 

Val Asn Leu Asp Asp Tyr Leu 
1130 " 1135 

tec ctg ccg ate gtc teg ccg 

Ser Leu Pro lie Val Ser Pro 
1145 1150 



gtg gac ctg gec ctg cgc tec age 
Val Asp Leu Ala Leu Arg Ser Ser 
1140 



gtc cat cag ggc tat 
Val His Gin Gly Tyr 
1155 



gtg gat ggg 
Val Asp Gly 



ggc ttg ttc gec aac aac ccc acc atg tea gec gtg gee cag gee 

Gly Leu Phe Ala Asn Asn Pro Thr Met Ser Ala Val Ala Gin Ala 
1160 1165 1170 

ate tac gee aag gcg gcg gat gtg egg gac ate etc gtg ttc tec 

lie Tyr Ala Lys Ala Ala Asp Val Arg Asp lie Leu Val Phe Ser 
1175 1180 1185 

ctg ggc aca ggc gac teg gtg gac tac ctg gac ggt tat aac gag 

Leu Gly Thr Gly Asp Ser Val Asp Tyr Leu Asp Gly Tyr Asn Glu 
H90 1195 1200 



aac tgg ggc tgg aga aag tgg ctg ctg gac ccg aag 
Asn Trp Gly Trp Arg Lys Trp Leu Leu Asp Pro Lys 
1205 1210 1215 



cag ccc atg 
Gin Pro Met 



10580 



10625 



10670 



10715 



10760 



10805 



10850 



gee ttc gtg gec gee acc ate gag gcg gga gtg gag gec ate gac 
Ala Phe Val Ala Ala Thr lie Glu Ala Gly Val Glu Ala He Asp 
1220 1225 1230 



ttc cag gca aag atg ctg ctt ccg egg ggc aac tac 
Phe Gin Ala Lys Met Leu Leu Pro Arg Gly Asn Tyr 
1235 1240 ~ 1245 



tgg agg gag 
Trp Arg Glu 



10895 



10940 



-srt zz~ gtg gtg ccc ttc aac 
Pro Val Val Pro Phe Asn 
1 :: ~>0 1255 



?ec acc ttg gat egg gtc 
He Thr Leu Asp Arg Val 



ctg ggc aac agt gtg gag gee cag 
Leu Gly Asn Ser Val Glu Ala Gin 
1260 



gtc aag agg cac gat 
Val Lys Arg His Asp 



ctg age aag 
Leu Ser Lys 



10985 



11030 
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1270 



1275 



gtg ctg gag tgg gtg gag aac teg ggc tgg caa ccc 
Val Leu Glu Trp Val Glu Asn Ser Gly Trp Gin Pro 
1280 1285 1290 

cag gag aca gec tct ggc gcg cag cgt gag ttc agt 
Gin Glu Thr Ala Ser Gly Ala Gin Arg Glu Phe Ser 
1295 1300 ~ 1305 



aag gec acg 
Lys Ala Thr 



ggc gag teg 
Gly Glu Ser 



gga gag gcg egg age tga gcaggggttc gaacggcctg acggccacct 
Gly Glu Ala Arg Ser 
1310 



11075 



11120 



11168 



gcttgacatg gggtggccgc agetgeggea tctcgccttc ggaccgccgg aggtggagaa 11228 



atg ace ate ate cat ggg cag cca gga gee gac ggg 
Met Thr lie lie His Gly Gin Pro Gly Ala Asp Gly 
1315 1320 - 1325 



aat ggc aag tac cgc ate etc 
Asn Gly Lys Tyr Arg lie Leu 
1330 1335 



age ctg gac gga gga 
Ser Leu Asp Gly Gly 
1340 



acg ctg ctg tec att ggc ctg ttg aag cgc ate gac 
Thr Leu Leu Ser lie Gly Leu Leu Lys Arg lie Asp 
1345 1350 " " 1355 



ccc ggc ttc ctt gat cag gtc 
Pro Gly Phe Leu Asp Gin Val 
1360 1365 



gac ctg gtg gcg ggc 
Asp Leu Val Ala Gly 
1370 



gga gec ate age gcg etc ate ate gcg gcg get cgc 

Gly Ala lie Ser Ala Leu lie lie Ala Ala Ala Arg 
1375 1380 1385 

gtg ggg etc gag cag gcg egg cag etc tgg etc acg 

Val Gly Leu Glu Gin Ala Arg Gin Leu Trp Leu Thr 
1390 1395 1400 

ttc cac tea teg etc agg ace aga ttg ggg gee ate 

Phe His Ser Ser Leu Arg Thr Arg Leu Gly Ala lie 
1405 1410 " 1415 



get cgc egg 
Ala Arg Arg 



gga ctg egg 
Gly Leu Arg 



cag* ttg egg 
Gin Leu Arg 



ace teg gcg 
Thr Ser Ala 



gag cct gee 
Glu Pro Ala 



cat gag ctg 
His Glu Leu 



att ggc atg 
lie Gly Met 



aac gcg ctg acg ccc age aag aac atg get cgt gcg etc ace cac 
Asn Ala Leu Thr Pro Ser Lys Asn Met Ala Arg Ala Leu Thr His 
1420 1425. 1430 



11273 



11318 



11363 



11408 



11453 



11498 



11543 



11588 



ate ctg gga gac aag acg etc 
lie Leu Gly Asp Lys Thr Leu 
1435 "* 1440 

ate ccc gec ttc aag ttg gac 
lie Pro Ala Phe Lys Leu Asp 
1450 1455 



ggg gac ctg aag cgc 
Gly Asp Leu Lys Arg 
1445 



aag gtg gtg 
Lys Val Val 



gat gaa aac cct gac gag aac aga 
Asp Glu Asn Pro Asp Glu Asn Arg 
14 60 



egg gag tgg aag ccg cga gtc ttt cac aac ttt ccg 
Arg Glu Trp Lys Pro Arg Val Phe His Asn Phe Pro 
1465 1470 1475 



ggt gee gac 
Gly Ala Asp 



ttc gtc aac cca gga gac cgc ctg gta gac ctg gee ctg cgc tec 
Phe Val Asn Pro Gly Asp Arg Leu Val Asp Leu Ala Leu Arg Ser 
1480 1485 1490 



11633 



11678 



11723 



11768 
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age tec ctg ccg ate gtc teg ccg gtg tac cag ggc tat gtg gat 
Ser Ser Leu Pro lie Val Ser Pro Val Tyr Gin Gly Tyr Val Asp 
1495 1500 1505 



11813 



gga ggc ttg ttc gcc aac aac ccc acc atg gca gcc gtg gcc cag 
Gly Gly Leu Phe Ala Asn Asn Pro Thr Met Ala Ala Val Ala Gin 
1510 1515 1520 



11858 



gcg ate tac gcc aag gcg gcg gac ctg egg gac etc etc gtg ttc 
Ala lie Tyr Ala Lys Ala Ala Asp Leu Arg Asp Leu Leu Val Phe 
1525 1530 1535 



11903 



tec cag ggg gcg ggg gac atg gtg cac tac ctg aag ggc tac aac 
Ser Gin Gly Ala Gly Asp Met Val His Tyr Leu Lys Gly Tyr Asn 
1540 1545 1550 



11948 



gag aac tgg ggc tgg tgg aag tgg ctg ttg aac teg aag ttg ccc 
Glu Asn Trp Gly Trp Trp Lys Trp Leu Leu Asn Ser Lys Leu Pro 
1555 1560 1565 



11993 



tgg gcc ttg gtg gac ate gcc gta gcg gcg gga gtg gag gcc ate 
Trp Ala Leu Val Asp lie Ala Val Ala Ala Gly Val Glu Ala lie 
1570 1575 1580 



12038 



gac ttc cag gcg aag agg ctg ctg ccg cga ggc aat tat tgg aga 
Asp Phe Gin Ala Lys Arg Leu Leu Pro Arg Gly Asn Tyr Trp Arg 
1585 1590 1595 



12083 



gag gat ccg egg gtg ccc tct cat ctg ggc aac age gtg gaa gcg 
Glu Asp Pro Arg Val Pro Ser His Leu Gly Asn Ser Val Glu Ala 
1600 1605 1610 



12128 



cag gtg gtc acc ttg gat cag ate gtc gaa aag cac gat ctg age 
Gin Val Val Thr Leu Asp Gin lie Val Glu Lys His Asp Leu Ser 
1615 1620 1625 



12173 



gag gcg ctg gag tgg gtg gac age teg ggc tgg ctg cca aag ggc 
Glu Ala Leu Glu Trp Val Asp Ser Ser Gly Trp Leu Pro Lys Gly 
1630 1635 1640 



12218 



atg cat gag gca gcc tct ggc gcg cag ccc gaa etc age ggc gag 
Met His Glu Ala Ala Ser Gly Ala Gin Pro Glu Leu Ser Gly Glu 
1645 1650 1655 



12263 



teg gga gag gcg egg age tga gcagggcccg gagctgetet gccacccgct 
Ser Gly Glu Ala Arg Ser 
1660 



12314 



ecaegtgagg 


tggccgcagc 


agegtcatat 


ggtctcccgg 


cgcctcgcgc acatccacgc 


12374 


ccgaggcgct 


gagggctccc 


cagcccaggt 


ctcgcgcgaa 


gctccaggcc tgeteggagg 


12434 


t geggaagag 


gaegatgege 


tggeggagge 


gaectegggg 


ccgataggag agcatggctc 


12494 


gctggtgagc 


ccgggtcagc 


ctcaacatgg 


gggcaatgee 


aggctctcgc agcagcagca 


12554 


ijC^ogt^ag to 


ctcaggcacc 


acccgggcca 


tggcggcatg 


gtccagcagt tcctgcacca 


12614 


accct ccttt 


ccgcaggcgc 


tcccatcctc 


gggtcaggga 


getgetcagg' cgcgaggcag 


12674 


cgirctgga t L 


cagaggagcg 


cgagccgcca 


ggtagagata 


gtcgagcgta tagggegeca 


12734 
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ggccgcgcag caccgagccg ctgaagaagc gggccgtcgc ccagagactg gctcgctggc 127 94 

ccgaaggggc agggaagtcc agcaggctca gcagcgccac ctgctctcca gccgcctcga 12854 
gctgtagggc aacctcataa gccaccacgc agccgaagga ccagcctccc aggcggtagg 12914 
gccctcgcgg ctgcagctct ctcagctcgc gcacgtacag ggccgccagt tcctccacct 12974 
tgtccatggg ctcctgcccc tcctgcaacc ctggcggctg gagagcatag aaggggtggg 13034 
tgggcgccag gcagcgggcc agcgtgaagt agggaaacac ggtgcccatc agcggcgcag 13094 
cccagaagaa gggagggctc tcgccctgag gttggatggg caccaggtgg cgccacttcc 13154 
gagcgggttc gctggagaca gccggtgacg gctggggagc ggcggctggg attgccccgg 13214 
tggccgaggc aggctcgtcc gttgccgggg ctggcatggc ggagagcagt gtgcgcagcg 13274 
tagggcgggc gaggaaggag gagagtggca attccacctc cagctccttg cgcaaccgcc 13334 
ccatgagctg gatggcgagc agcgagtccc ctttcagctc gaagaagtca tcatccaacc 13394 
ccacctcgga gacgccgagc aggtcctgcc agagcgcggc cagggcgcgc tcgcgctcgg 13454 
tgcgaggagc aacatagggc ttggagcgtt cggtcctggc gggggagccg cggggcaccg 13514 
agggaagcgc cggctcccag gaggagacgc gcaggcgctc gatctccgcc atgcgcgcgc 13574 
gcagctcacg ggtggagacg gccacctgcg gcaggccggc cgcgaggatg cgctcgaagg 13634 
cctcgcagcc ctcggcggag cgcagcccga caatcgccac gcctgcccgg ggttgcactg 13694 
gccttgagga gctcgggagc gctggcggcc gtgacaaggg ctcttccggg gtgagcacca 13754 
ccttgcctac gtgcctgccc ttggagacga gctggagcgc ctcggccacg cgctccagtg 13814 
gccatgccgt gtagggcagc ggctccagct ctccgcgctc gatctgccgc agggcctcgc 13874 
ccatcagctc ccggtagccc ttcatccccg ggtctatctg cgccgcgagg aaggtgagcc 13934 
cgcgctcgaa gggagccagc ggcagcgtgc cgccagagag catgtccctt acacccagct 13994 
cgacgaagcg cccgtggcga gcgagcacgc ccaggctgcg ggagaggaac tcgccgctca 14054 
gcgagttgag caccacgtcc acgcctcgcc cctgggtgtg ttgcagaagc tggtcgtcaa 14114 
acccgggcgc gcgcgagtcc atggccagcg ttacccccag ggagcgcagc. agctcgcgct 14174 
tcggctcgct gcccgcggtg gcgtacaccg tggcgccccg ccgcagcgcc aacttcaccc 14234 
ccgccagtcc cacccccgtg gccgcggcgt ggatgaggac gtgctctccc gcgcgcagcc 14294 
gcgccagctc ctccagggca taccaggcgg tgaacagagt gacgggcacc atcgccgcct 14354 
gctcgaagct caggcccgct ggcttgggga agacctgctc ctgggggact cgcacatgcg 14414 
aggcgaaggc tcctgctgcc gcggccacca ccgcctcgcc cactcgcagg cctctcactc 14474 
ctggacctac cgcggagatg atgcccgcgc actcgactcc cagcgccagt ggggcgccgt 14534 
tggccagctc cactggcaac gcccccagcg ccagcagcgc gtccttgaaa ttgaggcccg 14594 
cggcgagcac ctgcacctcc acctctccct cctgcggcgg aaggcgctcc aggggttgca 14 654 
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gtcgcagcga gtcgagcgcg cctggtcgtg gaagcactgc ctggcgcgga gccttgccag 14714 

cgggctgtgg ccgagccaac gagtccacct ccacccggcg gagcacgaac tcctccacgc 14774 

acagcagctc ccgcccttcg ccgtccatga ggcgcacctc cagccgcacc actccggcct 14 834 

gggcactggc gggctccagc cggcggatgt ggctgtacag ctgccgcggc agcgggccgt 14894 

ggatgcgggt gttgccatag gacagcggca gccaggcacc ggggggcagc ggtgcaaacc 14 954 

cggtcgccgc gtccagcagg gcagggtgca ggggccattg ccgcagatcc ' tcggccagct 15014 

cctcggggag ctcgatcaag gccaggccct cttctccctt ccacgcgcgg tgccagctca 15074 

gcgaggacca gcgccggccc agggcaggtg ccagctcgcg ggggccctga gctggctcat 15134 

gctccctcgg cgcctgggtc aagccgagcc gttcctccag ctcccgtagc gggtgggtgg 15194 

gtcgagcgct cccgcgaggc agcatcctca cccggcccat cgcatggctg cgccagcccc 15254 

cattcgcgag gccgccagcc cggctgcgga tctccagcgc atagccctca tcctccggtc 15314 

gcaggtggaa tcgcacctcc acctcctccc ccgcgggcgc ttccagcaga gagagcagca 15374 

ccagttgctc cagctccacg ggggcctcgc ccagttcctg gaccagggcg gctcgcacca 15434 

gctccaggta cgccaccccc ggcagcgtgg cccgtccttc caggcggtgc tcgtccagcc 154 94 

accagcagcg ctcggcctgc catcgcaccg tgtactctcc gggtgccgtg cgctgcagcc 15554 

ccagtgctcc cgccgccagc cccagcccca ggcgcatggc ggcgcccacc tccagccagc 15614 

gatcccagtc catggagatg gtgcggcggt ccgtgcgccc tcgctgtgag tgcgcatacg 15674 

cgtcgaggaa ggcactggcg gcggcatagt ccgcctgccc cagcgcgccg agcaatgcgc 15734 

tcagcgagga gcacaggacg aagaagtcga gtggctcgtc gcgcagcagt tcgtccagca 15794 

ccagggtgcc ctcgaccttg gaggccagca cgcgcccggc cgactcgcgc gtcttgagct 15854 

gcatcagccc atcctccagc accagcgcgg cgtggatcac cccatggagc gccccatggc 15914 

ggcgcttcac ctcggcgagc acctcgccca tgcgctcccg atcggtgacg tccgccacca 15974 

ggggcagcac ctcgcctccc gcttcttcaa tggcccgcag ccggcggatc cgctcctgca 16034 

tggccgcgcg cccgcccact ggctgaagcc cgctctcgcc gggccgctgg gcaatgagga 16094 

gcacgttgtc ccagcgtgag cgctgctcca cggccgctgg cagcacctcc acctcggcaa 16154 

agcccacgtc gcgcagggcg tgctcccagt cgcctacctc cagcaaggga gagcgggtgc 16214 

gcagctcgtc ctcgtagctc caccacccct cggccaaacc ccagatcatg tccacccagg 16274 

gctgctgctt caccgtctcc accaggcaca ggtggccgcc tgggacgagc aggctctcga 16334 

ggtgggccag ggactgagga acgcgcgggg tggcgtgcac cacgttgagg gcaatcacca 16394 

gatcatacga gccaggcgga tggccctgct ctgggggagg gcgggagatg tccagcaccg 16454 

aggtgtggag gaaggtgagg ccctggcggc ggccgactcc ctccatggcg cggacgaagg 16514 
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aggggccaat atccgtgaag tggtactcca ccgggtggtg ccgcagcagc ggcagcaggg 16574 

cctggagcac cacgccactg ccgccgccca cctcgaggat gcgcagggta cgcccctgcg 16634 
tgcgctcggc atggcgggcg aggaaatgcc gcagcagcgc cacgcactgc cccatgctgc 16694 
tccactcgac cccgcttcgt gcggagtcct octcggagcg gcccgagggg tagagcacac 16754 
tgagggccgg caggctccct cggagcacct cagggtagtg gcgcacacaa tgctcaagca 16814 
gctccagcag gcctgacacc tccgggtagc cttcgagcag tcgctcgtgc agtgctcgag 16874 
gtgtgggcac gtgggctggc tccacctgac cctcgagcat gccctctccc actcgaatca 16934 
gcttcgcctg ctccagggtg ccgatcatga aggagaagag gcgctcgaag ccgggctgca 16994 
gccccaggcg ctcacgcaat gcctgcatcg ccctcggctc tccaagtctc agaggcgcct 17054 
gaagagggaa gagaaagtgg tagaggtagc tggcgcacag caggcgcagc gactcttcca 17114 
ggccgggact gtcggccagg gagtgcaggg gccgctggtg ctcgatctgc tcggcggcac 17174 
cgcgcagcca ctcgagctcc tgtgagagcc gcagccgcgt gggagtgccc. agccatgcgc 17234 
tccagctctc gcgcggtggc gtgggcgagc ggcccgtgag caccagccgc gccgacacgc 17294 
gccgcgccag gtgctctgcc agcgcgagcc ctacgccccc caggccgccg acaatcaaat 17354 
acacccctct ctgccgaagc gggagctgcg aaggctcggg cgggggcagc ggcacgggga 17414 
cgaactcgcg cacccagcga gccgcgccgc ggtacgccac cgagcgttcg gacgcggtgg 17474 
actccacctc cgccaccagc cgctccagca gctcctccgc cacgccgcct ggcggcaggg 17534 
ccatgtccac gaggcggcac gtgcacccgg ggagttcctg gggaagcacg gaagcagggc 17594 
cctgcaggag cgccttgagc ggctgtatcg gctcgccctc tgccaccgcg tagagctgat 17654 
cggtcaccac cgtgagctcc accgcgggcc gggtgccgtg ggctcccagt gcctgggcca 17714 
gcgccagcag ggtgtagaag ccgcgctcct gggggctctg gctgctcccc agcagccaga 17774 
ggtggagcac cctcgagggc agctctccgg agtcctccag caggcgttgg aagtcctcgc 17834 
ggcggcccgg gggcaggcgc cagtgctccg gggtcaggcg ctcgagccgc tcacctggct 17894 
cgacgagggt gacggagtgt ccccgggccc ggagccgctc cgcgagcccc ■ tggctccacc 17954 
catgggcctc cgccaacaac agcagccggc ccacccgtct cgagggggcc gccgagggca 18014 
gagccgcgcg gctccaacct ggcacgtaga agctgctggt gacagggctc tcggcaggtg 18074 
tcccaggccg ctcgggaggg gaaggctggg gggctgtctc cacccagtag cgctggtgct 18134 
caaaagggta ggtgggtagc accgcgcgcc tccggcgttc gccgcggtgg agcccagccc 18194 
agtccacctc ggctccctcc acccacagtc gcccgattgc cgtcagcagc cgctccacct 18254 
ccgagctttc ctcccggctg gagcccagcg tggacaccac ctgcacgggg ccggagcttt 18314 
ggcgcgccgc cagccgcccc agtgtgtggc ccggccccac ttccaacagc agccgcggct 18374 
gcacaccctc gcacaaccgc tccagcccct gggcaaagcg caccggctgc agcaggtgcc 18434 
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ggccccagta ctccggtgag gtcgcctcct cggcctctat ccaggtgcca gtcacattgg 18494 

agatgaaggg caggctcgga ggccgcagat gcatccggcg cagccgggcg acgaatgggg 18554 

ccaccgccgg ctccatcatc ggcgagtgga aggcgtgaga gacagccaga ggcgtgcacg 18614 

cgatgccccg ctgctcgagt cgtgcgcgca gctcggccag gggctcggtg gggcccgcca 18674 

ccacacaggc ctgggggctg ttgtgcgcgg ccagggccag tgccgcgggt aggagcggtt 18734 

ccacttcctg ggctgggagc cccacggcga gcatgccccc cgggggcatg ctgcccatga 18794 

gctgtccacg tgcggccacc acctccagcg cctcttccag ggagaacacg cccgcgaggc 18854 

acgcggccac atattcgccc aggctgtggc ccaacagcgc ggtggggcgt atcccccagg 18914 

cctccagcag tcgggccagc gcgtactcca cggtgaagag caccggctgg catacccgcg 18974 

tctcccgcag gcgctcctgg gcccattccc gctcccgcgg gccagggtac agcaggcggc 19034 

gcacgtccaa cccgagcacg ccctggagtt tcccggcaca gagatccacc tgttcacgga 19094 

agggcgcctc ggactcatgg agcgccgctc ccatgcccac gtactgtgag ccctggcccg 19154 

gtagaaggaa ggccacggag cgggcctggg ccgcactctg gccacgcagc agcgcctcgg 19214 

gtcggcgcct ctccagcagc tctaccgcct gggcagggct gctgctcacc agcgctgtgc 19274 

gccaggccat gtggcggcgg cctacctgga gcgtgtgggc cacctcctcc agcggctcct 19334 

ggggctgctg gcgcaggtgc tcgcgcagcc gggcctcgac ctcctcccgg gcgcgctccg 19394 

tgcgtgctga cagcggcagg agctggtagg gccgcgctgc ccgccgcggc gccggccggg 19454 

gcggcgcctc ctccagcacc acatgcacgt tggtgccacc aataccgaag gagctgacgc 19514 

ccgcgcgccg cgggccgtcc accggctccc acggctgccc ccgcccgctc acgtagaagg 19574 

ggctgcgctc gaagtcgatg cgtgggttgg gggaggtgaa gtgcgggcat ggcgggagct 19634 

ggcggcgctc cagcaccagc acggccttga tcaggctggc gatgcccgct gccgcatcca 19694 

ggtgtccgat gttggccttg accgagccga gagcgcagta gccgcggcgc tccgtgtgct 19754 

tgcggaaggc ccgcgtgagg gcctggacct cgatcggatc tcccagggcc gtggcggtgc 19814 

cgtgtgcctc cacgtagccg atgctgtccg cggagacggc cgccacctcc agcgcctcgg 19874 

agatgaccga ggcctgcccc tccacgcttg gcgcggtgta gccgatcttg tgtgccccgt 19934 

,nrr.gttcac cgccgagccg cggatgacgg cgtggatggt gtctccgtcc gccagcgcct 19994 

cctcgagcag cttgagcagc accaccccca cgccgttgcc tcggttggtg cccgccgccg 20054 

ccgcgtcgaa ggggcggcag tacccgtccg gtgaggcaat gcctccctcc tggtagaggt 20114 

agccctgggc ctggggcacg gagagagagg agccgcccgc gagcgccagg gcgcactcac 20174 

caczgagcag gctctggcac gcgaggtgga cggccaccag cgagctggag catgccgtct 20234 

gcaccgccac gctgggcccg cgcaggccca gcttgtacga gacgcgggtg ggcaggaagt 20294 
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ccttgtcgtt ggcgatggcg agctggaagt ctcccagcac ctccctcagc cgggcgttcc 20354 
cgtgcaggga atgcaggtag tagctgttga ggctggcgct ggcgaagacc cccacgcgcc 20414 
cccttgcgcg ctcggagcca tggccggccc gctccagcgc ctcgtgcgcg cactccagga 20474 
agagccggtg ctgtgggtcg agcacctccg cctctcgcgg ggagaagcca aagaacgcgg 20534 
cgtcgaactg gtccatcctg tccagcaggg cggcgcgccg cacgtaaccg ggttggcgca 20594 
ggagcgaggg ccccactccg gaggccagga gctgttcgtc cgtgagctgg gtaaaggcgt 20654 
ccacgccctc gtggatgatg cgccagtagt cctccagggt gggggcctgg gggaagcgcc 20714 
ccgccatggc cactaccgct acggctccct cgagctgcgt gtcgtggctc atgcgtgtct 20774 
agtcctcgtc ccgttgtgcc tggcggcgag tgcgtctctg ctccagcgcc gcgccgcgca 20834 
tcccccgccg gcgctcggcc tgcgtgggag gggaggcctg tgctggcagc tgggcggctc 20894 
gaggctccag gtagtccgcc agcgagtgca ccgtggggta ggcgaacagg tccatgagcg 20954 
tcagctccag gccgagccgc gcgcgcagtg cccgctccac gcgcaccagg aggatggagt 21014 
tgccgcccag gtcgaagaag ctggcctcgg gcgcgatgtc ctggtgctcg agcacctccg 21074 
cccagacgga ggcgatgagc tgctccagcg ggcgcgaggg gctcgggacc ggagcgggtt 21134 
gcggcgccct cagctcacct gcttcgaagc gcgccctcag ctccgagcgg cgcagcttgc 21194 
ctccctccgt gcgtgggagc tggtgaggct cgagcgggat gagatgatcg atccgcaccc 21254 
ccacctgctc gagcacctgc tggcggatgc gcgacaggag tgaggcaggg ggttgcggag 21314 
cgagtccctc cgtgggcacg aagaagaccg ccagctcctc cctgctcccc tcgagcatcc 21374 
gcaccgccgc cgccgccgcc gaggagggcg ccactccctc cacctgctcc accgcggact 21434 
cgatctcgta gcaggagaag ttggtgccgt ggatgatgac cagatccttg gcgcggccgg 21494 
taatggtcag cgctccctcg gagaggaagc ccagatcgcc tgtctccagc cagccgtcgg 21554 
cggtgagcag cgccgccgag gccttcgggt ccccgtagta gccgggagag atcatctcgc 21614 
cgcgcacttg aatgcgcccc acgcactcct cggagagcag ctcgccgctc gcgtccacca 21674 
cccgcagggc ggtgccggca atgggggctc ctacatccat cagccgcagc gcggtgggcg 21734 
aagcgggtgc cgcgtgccga agcggaccgc tcaagctcgc gcgctccagg gtgtgcatgc 21794 
ggggcggcgt tccgggccgc cgcgcatagg tcacccccga agccgtctcg gtcattcccc 21854 
aggcagcgac gaaggcatcc tcccgcactc ccgagggagc caacctccgc atcagctcct 21914 
ccaccatggg cgcggagatc tgctctccgg cgctgagcag cacgcgcacg cctcccaggc 21974 
tccagcgccg gtcctccacc ttgcgcagcc gctccagcag gtggctgtag gcgaagctgg 22034 
gcgcccacgt ctgtgcaatg ccataccgct cgagccactc cagccatcgc aggggctcgg 22094 
cgagcacctc ctcggtggga gcgtggaagg tgtgcatgcc ggcgcagagc gggcggaggt 22154 
ggtactcgga cagggcgcca atgttgtgca gctgcaacca gacgaggccg cgctcgagcg 22214 
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gctgggactc gtagacgacg ttggtggcct ccaggcgcac cagcaggttg cggtgcgtga 22274 
gcatggcgca cttggccttg ccggtggtgc ccgaggtgag gttgagcagg gccacgtcat 22334 
ccatgcgccc tgggggcagg ggcgcctcgc cacccgtggc tctcagctcc tccacggccc 22394 
ccaggatgtc ggcggtgggc gagacctgaa gctgcctggc aagatcgggc agcatctcct 22454 
gccgggtgag gacccggggc ccacccagca gctgccgggc atgccgcaga cgtgagagcg 22514 
cgggactcgc ctccgctctc gcgggcggcg ccggtagcac caggggcgcc actcccgcca 22574 
gcacgcaggc ccagagtgcc tctaccgtct cttcctccct ccccagcacc agcaccagac 22634 
tgtcgcccgg ccgcaatccc cactgccgca agccggcctg caggcgccgg gcccgctcca 22694 
ccagctctgg cagagtccag gccgtgcaag agccctgtgc ctggacgaag ctgaccgtgg 22754 
ccgtggggaa gcgctcggcg gtctgtagaa gtgcctcggc cagtgtccga ggggctgtgc 22814 
tcgggcgtgg gaggggcgga cactccgaca gggcgagaac ctgcttccgt acgggcgggg 22874 
ccagcgtctc acccgccagc tcccccagtg tccgcgtcgg gtccgccgcc acggcttcga 22934 
gcaggctcac gtagcgccgc accagttgcg ctacgccctc ctggccatac aggtccacgt 22994 
cgtagatgag ggtgccctcg acgccctggg cagactccca gaggtggaac tccagatcga 23054 
agcgggccac gcggctgcga agggggaact ccgagagcag caggcctggc agctcggcgc 23114 
gccgcagcgg gtgctgctgc atggcgagga agatctggaa gagcgggtgc cgatccatgc 23174 
ggcgccgggg ctggagcacc tccaccaatc gttcgaaagg cacctcctgg tgctcgaggg 23234 
cgccgtggag ctgctcggac acctgggcga gccatgaggt gaaggcggtc gaggcggagg 23294 
gcttcgcgcg cagcaccagg gtactggcga agcagccgat gaggccttcc agctccttgc 23354 
ggttgcggtt ggcgatgggc atgcccaggg ccacgtcgtc ctggccggcg caccgggcca 23414 
gcagccattg gaaggcggcg agcagcacgg agaagagcga ggtgcgccgc gcttgagcaa 23474 
gtgagcgcag ggcctgcacc tggtgtggtg gcagggcgaa ttcgaacgtg gcgccctcct 23534 
gcttggacag cgggggcctg ggccgctctg gcggcagggt gctcaggtcc ggcagccctg 23594 
ccagctgctc ctgccagtac tcgagcagct cgcgctcgcg cccacccttg agccggctcc 23 654 
gctcccacgc ggcgaagtcc gcgtactgca gccgcagcgg tggcagcgcc ggtggctccg 23714 
cgtgtacagc agccgtgtac agcgcgcaca gctcctgcac gaagacgccg agcgaccagc 23774 
catccgctgc gatgtggtgc gtggcgaaca ccagcacgtg ttcctccggc tccactcgca 23834 
ccaggaggaa gcggcacggc ggctctcgct ccagatcaaa ggaggcgagc gctgcgtgct 23894 
".r ntrrsascs gtacacctcg gcctctcgct cctgggcggg cacgccgctc aggtcgcgct 23954 
gcccgagcgc agccggtgcg ggcgcgagga tgcgctgcag gggttgacct ccggcggagg 24014 
ggaacaccgt gcgcagcggc tcgtgacgcc ggacgaactc attcaggctg tgcaccaggc 24074 
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actccacctt cagctttccc gtgagccgga gcgccgcgtg cacgttgtag aaggggctcg 24134 

taggcctgag ctgttgcagg aaccacatgc gctgctgggc ggacgagagc ggcagcccct 24194 
cgggcttgcg ctccacgggc atcagcaatg cgccctcgtc caggccctgc tctttcatca 24254 
gggccgcgag tagctcccgc tgctccggcg ccaggctctc cagctcctcg tgcagcttac 24314 
tcatgtggtc tccccttgct gccgcggcgg gcctcgagca gcgctcgcgc ggcctcgggc 24374 
gtgagcgtct tcacctgttg caggagctgc gccagttgcc gcgcctccgt tcctccctca 24434 
gcgggcaccg aggccgcctc ctgcccccga gtggggagta ctccgcccac caggctgtgc 24494 
agcagcacgt gctcctcctc caagtggaag agcccccgcg cctgggcgaa gtccaggtcg 24554 
ccgatatggc acagccccag cccactggga gccgcgccca gatcgagaag ctgggtcatc 24614 
agccctgcct ccagcgtggc gaagtgcagg gcatgttcgg cgtataccgg ctccacggcc 24674 
cccatgcggg cgatgaggaa gagggagaac gcggccgagt cgaagatggc ctggttctgg 24734 
tgcggatcat ggatgcggcg atccagcccg gctccggcgg acaggagcac cagccgatgg 24794 
gtgctgggat cgtaatacca ggtgccgggc tcgagccctc gcgcccggcc gggcttgagg 24854 
tggaggtaga gctgcaccgg gtagagcccg cccgcggagg cgtagaggta acgccgcgag 24914 
ccctgcacct cccactcgcg caagggggcc agcagccgtc ccaactgctc atggctcacg 24974 
ggctcgaggg agtaggtgcg cacactgcgg cggcgctcga gagctgtcag ctcctcgggg 25034 
gccggcgcca agccgctgcc tggcagtgcc acctcaggcg ttccatcggg gaagcggcgc 25094 
aggcccgggc ggttggcctt gaagcgcccc cactcctcgg cgggcagaag cgtggagcgg 25154 
ggcgccgtga gccgcggagc gcctccctct ccctggctct ttgtcttctc ttggtggagc 25214 
gtggcgagcc cctgcacggt ggggttctcg tagatgtcgg ccagtcgcgg gcggaactgc 25274 
agctctgcct ccaggcggtt gatgaggcgg atgagcgcca ccgagtccgc ccccaggtcc 25334 
agcaggctgt cctggggctc cacgcgctcg agcctcagcg cctctttcac cagggctgcc 25394 
agccgctcca cgagcgggtc ggccgccgcg gcctgtgcag ccagtccctg tgtctgctgc 25454 
ggctcgggga gctgatctcg ggcgatcttc ccgttgcggc tgcggggcag ggactccagg 25514 
aggacgaagg cggagggcac catgtacgcg ggtagccgct ccgccaggta gcggcgcagc 25574 
tcgcctgccg cgggcgtctg acccgagcga ggaaccgcgt aggccaccag ccgccgcact 25634 
ccccgaggct ctcctcgtgc caccaccacg ctcgccgata gcgccgggtg cfcgggccagc 25694 
gccgcttcaa tctcccccag ctccacgcgg aagccctgca ccttgacctg cagatcctcc 25754 
cggccgagga actcgatgct cccttcgggc agcatgcgcc cttggtcccc ggtgcggtac 25814 
agccgctcgc ctgtgcgcgg gtgacggatg aagcgcgttg cggtgagcgg ctcatccctc 25874 
cagtactcgc gtgccagccc ctcgccgcca atgtacagct cgcccggcac ccaccagggc 25934 
cgtgcctcca gccgcccgtc cagcacgtgg aatcgctgat tggccagcgg catgccatag 25994 
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ggaatgctct tccactgcgg tgccacctgg ccaatggggt aggcaatgga ccagatggaa 2 6054 
gcctcggtgg cgcctcccag gctcaccacc tgcacgtccc tgccgagtgc gcggatgcga 2 6114 

tccggtagcg ccacggggat ccagtcgccg ctgagcatga ccaggcgcag cgccgcgggc 2 6174 
agcttcagcc cgcgctgctc ggcgtactcc accagcagca gcatcagcgc cggggtggag 26234 

ttccacaccg tgacccgccc agccaccagc cgctcccacc agtgggcggg atccttctcc 2 6294 

gcctccgccg cgggcagcac cagcgctccg cccgcgccga gcagccccag cacgtcgtac 26354 

accgacaggt cgaaggtgag cgcggacagg cccagcaccc gatcctccgg gccgacgcca 26414 

aagcgggtgt tgaggtccac cacggtgttg agcgcggccc ggtgttcaat ggccacaccc 26474 

ttgggtttgc cggtggagcc ggaggtatag atgacgtagg cgaggtgctc cggtgtaccg 2 6534 

cgaggcggta gcgggggggc ctcggttgca ggctcgagtt cgtccacggc gatcacctgc 2 6594 

acacccggcg gccagggcac ggtgtgcagc aaggaggact gggtgagcac cacgcgggcg 2 6654 

ggcccctcct ccaggagctg gtggagccgg agcggcggct gctctggatc caacggcagg 2 6714 

taggccgcgg ccgcctgcag cacgccgagc acggcggtgg cctgctccca gcccttgtgc 26774 

atggcaatgg ccaccaactc ctgaggctgc acctccagct cgcgtaggcg cgcggccagc 26834 

gcctgggctc gccttgccag ctcgccgtag ctcagggtgc gctccggtgc gagcagcgcg 2 6894 

gggagctcgg ggtggagccg cgcctgggtg aagaagccct cctccagccg tccgctgggc 2 6954 

cgcggcgcct gggtggcgtt gtagcgcgcg agcaactcac gctgggcagg aggcagcagc 27014 

tccg'gcagct cgccctccca cgcttgctcc tcctcggcga gtgcccccag cagccgctgg 27074 

taggcgtgga acatgtcgtc caccatgccc ggagggaaga gcgcctccac cgagtcccac 27134 

gccaggacga gcgccccctc ctcctcgtgg actccgtggt ccagccacac ctggggagtc 27194 

tggctgatgc tgtacaccag ttctccctcg aagaaggaga ggctgccctg ggggccgcgc 27254 

cgcgcatcca ggctgaggat gctggtgaag acgacgggca tgatggcgcc cggggagcgg 27314 

cggccggtgc ggacgagctc gcggatgagc tgcacggcgc tcacgctgcc gtgctccagg 27374 

tcccgccata gctgtgcctg gagccgggag gcacgctcgg cgaaggtgct cgccgcgtgt 27434 

gcctctacct ccagcaggac cagggaggtg aagtcgccca ccagctcgtc cacctgcggg 27494 

tgcaggggca agcgctggaa gagggtgagg ttgagggtga agcgcgggtg acggctccag 27554 

cgggcaagca cctcggcgaa ggcggccatg caggcggccg agggggtgag gccgtgggcg 27614 

cgcgcccgct cctggagccg ggcccagcgg tgtggctcga gccgagcctc gcggtgggtg 27674 

aaccgcgcgt gctccagctg cgagggttcc ttcaccaggg gcagctcggg cggtggcggc 27734 

agcgtgtcca gccgagccca ccagtatgcg cgggcgcgct catgggcctc gccctcgcgc 27794 

agccggcgct ccgccagcac gtagtcgcgg aaggtgatgg ccagcggctc cagcggccgg 27854 
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cccggctcgt ggtacagctg ggcgaagtcc tgccgaagca ccgctgaact ccacgcatcc 27914 

agcatcaggg catccatgct catgtggatg cggacgcctc cctcgtaccg gcagagcacc 27974 
agctcgaaga gcggccagcg gtcgctgcgc aggacctggt gggccatgcg ctcacgcagc 28034 
tggagccggc gggactcggc ctcctggggg gagagccccc gcagctccac cacctccggc 28094 
gtgtactccg gcacctgctc gagcacctgc tgccgcccat ccgggagcac caccatgcgc 28154 
agcatgtcgt ggcgctgcag gagctggcgc cagcattgga tgaagcgctc cacctccagc 28214 
cccgggcttt cgatttcgaa gtagccatgg gcggcgacgc cgccgagctc gaaggccgag 28274 
cggcgaccca cccagtaggc ttcctgcaca tccgtgaggg ggaaaggctc gaagcgcgcg 28334 
gctggatccg gcacgagcgc tggcgcgggg gcagtgggag ccgcggcctc accctgtcga 28394 
gcccgcagag ccgcggccag cgcggcgagg gtggggtggc tgaagagggt ggccgtgggc 28454 
agctccacac ccaggtgcct gcggagatcc gccgccactt ggatgcccag cagcgagtct 28514 
cctccaagct ggaagaagct gtccgtgggc tgcacgcgct ccacacccag gtgcttgcgc 28574 
cacacctgct cgatgagctg ctccagcgag gagggtggca aggcctgcgg ctcggcggcg 28634 
gcggcgcgcc cggtgacgag gcgctccttt tcgcgcacgc cctggctgcc gcccaggccg 28694 
cgcagcagcg agctgaagtc ctggggggag atgatgagct gcggtgcccc ttgctccaac 28754 
gcgcgcagga aggcctccgc accctcttca gaggagatgc cgcccgccag ccgcgggtcc ' 28814 
tcccaaggag gtacggccac cggtgcgggc ctggagcccc gagtcatgga cgcgggcaat 28874 
gccgtcgccc cctgcatgga gacggcgacg cgaccgatgt gcagcgcgcg ggccatgaag 28934 
ctgaaggctt cggccacctg ccgcgcgggg aagaggcggg tgggcaatgg ctcgagctgg 28994 
ccctgggtga gttgcgtggc cacctcctcg agcaccgctc ggaagtccgg gtggtgggga 29054 
ccgaagtcaa tggcggcgaa agtctgccct cgggccaggg ttcggaggcc cacctgctgg 29114 
tccg.cataca ggtccctctt ccccagctcc aggaagcggc cgtgcggggc cagcacggac 29174 
aggcccgcca ggagcagctc tcccgccagc gagttgagca ccacatccac cccacgcccg 29234 
ccggtgcgct cgcgcacttc gctgacgaag gaggtgctgc gcgagtccag cacgtgagca 29294 
atgccgagcg agcgcaggta ctcgcgcttc tgctcactgc ccgcggtggc caatatctcc 29354 
gccccggtcc gggatgcgag ctgaacggcg gccagaccca gccctccggc cgcggcgtgg 29414 
atgaggatgc gctcgccgcg ccgcaaccgg cccacggtgt gcagggcgaa gtacgccgtg 29474 
gcgaacggca ccatctgggc cgccccctcg gcgagcccca gcgaggcggg cctgcgcacc 29534 
acctggctct catccaccag cacgtaggag cggaagcagc ctggggccac cgccagcacc 29594 
tcgtccccca cgcgcagccc gctgacgccc tcgcccacgg cggcaatgcg tccggagcac 29654 
tcgcgcccca gtacgctctc ctccgcctcc aatgccggca tcatccccag cgcgcccagc 29714 
acgtccagga agttgagccc agcggcctcc acctcgatct ccacctggcg cggcccgggg 29774 
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gccttgagcg gctcgagcgg ctcccgcggg cccaccgcgt ggacgaagct ggacagcacc 31694 
tcgagctgta cagggcgctc aggcgcgtgg cggccgagcg cacgggccag ccgcagcagg 31754 
ctgaagaagc cctgttccag caccgcttcc ccgtccgggg tacaggggcc gggctccgcg 31814 
gtgagagccc aggcatggag gatgcgacgg gggagccggc cctgggactg gagcgcctgg . 31874 
aagagcgcgt cccagtcctc ctcgctctcg gggcggaggc ggaaggcgtg ctcactgagt 31934 
tgctcatagt gcccggccgg ctcgacgcgg gtgaggagcg cgccggagtc cgcgggccac 31994 
agcgcggcca gcacctgctc ggccaggccc gaggagtcca tgaaggcgag caccgggccc 32054 
gccaggggct gtcccccccc gctcgtggca ggtgcctgct cccaggtggg cacgtagaac 32114 
cagtcttcca gggcctcgcg cgagggctgc tccgggggca caagggacgc gggcgtcggc 32174 
tgctgcgcgc gcggctgcgg ggcagcctcc ggttcaatcc agtagcgctg gcgctcgaag 32234 
gggtagggtg gaaggtgtgc gcggtggcgt ggcgcggcgt gcagggcgct ccagtccagc 32294 
tccaggccgg cctgccacag ctcacccacg gcctcttcaa tgcacggcac ggccgtgtcc 32354 
ggttgcacgg gaaggctcgc cacctcctgg gtgcgggcgg ggcgggtggg gtgccgccgc 32414 
gcaagcgcgg tgagcghggt gccggggcct acctccagca gcactggctc cttgaggctc 32474 
cagagcgtct cgagggcctc ggagaaacgt accggctcgc ggagctggcg ggcccagtac 32534 
gtaggctccg tggcctcctc gccggtgagc caccggccgg tgaggccgga gacacagggg 32594 
aggcgagggg cctgcaggcg catgcgccgc agcagcgtgg tgagtggccc ctggcatgcc 32654 
tccatggagg cagagtggaa ggcatgggaa gtctccaggc gccggctgga cacgccccgg 32714 
ctctccagcg cggcggtgag cgcctctacc gccggtagag gacccgaggc cacgcaggcc 32774 
cgtggcccgt tcaccgcggc gatacacagc tcgctgccca gcagcgccgc cagctcggcc 32834 
tccggcagcg gcacggccag catggccccc ggtgggcatc cctgcatgag ctgtccccgg 32894 
gctgccacga gctgaagtgc atcttccagg gagaagacac ccgcgaggca cgccgccacc 32954 
agctcgccca ggctatggcc tgccagcgcc tgcggttgga gcccccagcc cagccaggtc 33 014 
tgcgccaggg catactccag ggtgaagagc gcgggctggg ccacccgcgt ctgcgccgcc 33074 
agccggcgtg cctgctcctc ctgctccgcg gaggggaaga gcacctccct gacatccagc 33134 
cccagcgtct gcctcaggag ccgggcacac tgctccaccg cctggcggaa ggtgggctcg 33194 
gattcatgga gcgcacgggc cgtgccgacc agcggagtgc cctgccctgg aaagagcatg 33254 
acgagagggg gccgctctcc ctcgtgagca agcggctcgg gtggccgcgg ctcgggagcc 33314 
tccaggcgct ccatggcctc ggccagggtg cgacaggtga gggcgcgccg gtaggggaag 33374 
cgggctcgcc ccacctggag ggtaaaggcc acgtccgcca gttccaggtc cgggtggcgg 33434 
cgcaggtgtg ccgcgagccg tcctgcgatg tgctcgagcg cgctggtgct tcgggccgag 33494 
agcagcagca cctggttggg acgggtgggg ccgctggcgg gcggcggcgg agcctchtcg 33554 
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aagagcgtgt gggcgttggt gccgccaatg ccaaaggagc tcacccccgc cagccgaggc 33 614 
ccgcgcgggc tctcccacgg gagcggctgg gtgttgacga agaaggggct ctgctccagg 33674 
tggagcgcgg gattgggccg ctcgaagtgg agcgtgggag ggatgagccg gtggcgcagg 33734 
gcctgcaccg tcttgatcac cccggccact cctgccgccg catccaggtg tccgatgttg 33794 
gatttgaccg caccgagggt acagaaggcg gtgtcagcgg tatgcgcccg gaaggcgcgg 33854 
gtgagcgccg ccacttcgat cgggtctccc aggggagtgc cggtgccgtg agcctccacg 33914 
taggtgatgt cgcgtgcgct gatgcccgcc gcggcgtgcg ccagggcgat cacctcggat 33974 
tggccctcca cgctgggcgc ggtgaagcca gcccgggagg cgccgtcgtt gttgagcgcc 34034 
cagccgcgga tcaccgcgtg gatgggatcc ctggccttca gcgcgtcctc caggcgcttg 34094 
agcaccacca cgcccacgcc atcccccgtg acggtgccgg tagcacgcgc gtcgaagggg 34154 
cggcagtgcc cgtccttcga ggcgactccc ccctcctcgt agaggtagcc ggagcgcgcg 34214 
ggcagccgca gcgagacgcc gcccgcgagc gccatgtcac actcgaagcc gcgcagcgcg 34274 
ctgcacgcca tgttgatgga gacgagcgag gtggagcacg cggtgtggat gccgagcgcg 34334 
gggccgcgca gattgagctt gtaggccacg cgcgtggcga gaaagtccgg gtagttgcca 34394 
aagaagcccg ccacatcctt ggagccgctg agcgggtccc cgtgtccgcg cagcagctgg 34454 
aagcggtagc tggtgtcccc ggcgcccacg tagacgccca cccagcctgg gagctgatcc 34514 
gggcgaaggc ccgcatcctc cagcgcctcc cacgagcact ccagcagcag gcgttgctgc 34574 
ggatccaggt ggctcgcctc gcgcggcgag tagccgaaga agcccgcgtc gaacagctcc 34634 
aggtcttcca gcacaccctt ggcgggcacg tactcggggc gccgacgtat ctgctcggac 34694 
acgcccgcct gtctcagctc ttcctcggag aagaaggaaa tggactccac tccggccacc 34754 
agattgcgcc agaaggactc cacgtccggg gcccccggga agcggccggc catgcctacc 34814 
agggcgatgg aagactgcag cgcttcatcc ccactaggtg tcattgggag ctccccgctt 34874 
cgaaccggcc cggcggcgtt gggctgcctg ttgctggagc aaggcgcgct ccgtggccat 34934 
gcgctgcacc tgcgcctcgg cggctgccag ctcctggggg gcctgcaggt gctcggccag 34994 
ggcagcaatg gaggagtagc ggaagagggt caccagttcg agccgccggc ccagtcgggc 35054 
ctccagccgg ctgcacacct gcgccagcag cagcgagtgc cctcccacgt cgaagaagtt 35114 
gtcctcccgg ctcacttcct ccagtcccag cacttccttc cacacctgcg ccagcacccg 35174 
ctccagctcg ttgctggggg gcaccagcgg ggcagattcc agccgacgcg agtcggggac 35234 
ggg—^crcc gcgcggtcca ccttgccatt gggagtgagc ggcagggact gcagcaccac 35294 
gaagcgtgag ggcaccatga agtctggtag atgggtctgc aggtgggccc gcagcgcgga 35354 
gcccsgcctc agtgagaggc tggcccgccg aggctccgag gacagacgcg cggaggctgg 35414 
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gggcggcgtg 


cggttgagcg 


gcgtggggcc 


cgggaggaag 


agcacgtcga 


aggccccgtc 


cgtgtagccg 


agcgactcgg 


ccagttgcca 


gggttgggac 


aggcgccggc 


gcagctcctc 


cgcctcggcg 


gcccggaccg 


cgggcagcac 


tggccctggc 


gggccctgcc 


ggagccgagc 


ggaccagtcc 


acccacggca 


cctcggaggc 


cacggcgttg 


tagcggaagc 


gtgccatctc 


atccacgtgg 


ctcacccgtg 


gatgctgatg 


caccagctcg 


ttgtcctcca 


gcaccgcacg 


catggagggg 


gcggcccgct 


ccagctccac 


gctccgcaca 


tcccccacga 


agacgaagcc 


cagcacctgc 


cgcaggtact 


cctgcgaggg 


cgcgtcgaag 


gagccggccg 


ccacgccgct 


cacgcggccc 


gggtgctccg 


gggggaggta 


gaaatccgtg 


cccacgtagc 


gctcgcagtg 


accacagccc 


acttcgagga 


tccgtcgtgg 


gcgcagccag 


tccctcatct 


cctccacggc 


gccgacgatg 


ttgaaggtcg 


gctctccgcc 


gaacacggac 


tcccactcgt 


gtaccagccg 


ctccacccgc 


ggcacggcgt 


agcagaccag 


caccaccgcg 


gactgcaccg 


ccgggtgggt 


gatgcggaag 


ccgcgtacct 


tgagctggtg 


atcctccctg 


cgccgcacca 


ggtctcccgt 


gagcggatcg 


gggatgaagc 


gctcggcggt 


cagcccgggg 


ccgcccacga 


gcagctcgcc 


ctcgtccacc 


acgctcaccc 


agcccaggtc 


cagctcctgg 


gggctgccca 


cgcggtgaca 


gtagccgttg 


atgagctggc 


actcggggag 


gggcagcacg 


tcccctccgg 


cgagcagctg 


ctccaccatg 


agccggaaga 


gcccggcggt 


cagctcctgt 


cccaggctct 


cgagcgactg 
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gagccaccgc 


gggtgcatcc 


ctggcgtcgc 


35474 


cctccgccca 


ggagaccagc 


tcactgccgc 


35534 


gaaggggtct 


ggatcctggg 


cccctgtgcc 


35594 


cacccgccgc 


gcgctgcctg 


tgctgcccag 


35654 


tcgcgcattg 


ggaatgcctg 


ccacccccag 


35714 


ccgcagtgca 


tcgaggctca 


gtccttgcgt 


35774 


cggtggcggc 


gtgcgcggcc 


cgatgtggag 


35834 


attcgggtgc 


gtgccgcgcg 


tcagctcgat 


35894 


ggccagcgcc 


acgaagagtg 


ccgggtccac 


35954 


ccgcacccgc 


tcgcgccacg 


cctccagcgg 


36014 


cgaggcgtgg 


aaggactcca 


gcagcggcaa 


36074 


cccgtcctcc 


acggcctcca 


cgcagcgggc 


36134 


gaagtactgc 


accaccgaat 


tgagcagcac 


36194 


ccactcgtcc 


gccgtgcggt 


gtagcagctc 


36254 


gcgacgcacg 


tagtccaacg 


ccgccgggga 


36314 


cggcagcagg 


gcaaagagca 


tcagccccgt 


36374 


cctgagcccg 


cgcacgcgct 


ccacccggtg 


36434 


cacgggttgg 


ccggtgtagc 


tgctcttcca 


36494 


cacgggggct 


tcgcggtaca 


tgtgcccgtc 


36554 


ctgctcctgc 


tgcgagccct 


gctccgagct 


36614 


ctccttgccg 


ccggggccct 


cgcgccccac 


36674 


gagcaccgcg 


gcctccacct 


cgcccggctc 


36734 


gtccacgcgg 


cccaggaact 


ccagcgtccc 


36794 


ccgatagacg 


cgagcccccg 


gcgtccggct 


36854 


gagctccggg 


tgttgcaggt 


acccccaagc 


36914 


cggggctccg 


tccggcacgg 


gctggagccg 


36974 


gatgggggtg 


ccgatgggca 


cggagccgcc 


37034 


gcaggtgaag 


gtgcagctct 


cggtggggcc 


37094 


ggcctcgcgc 


agccggcgta 


cctgcggcat 


37154 


gcgcacgccg 


c crc aocr c t c t 


fccracrc terete 


37214 


gagccacagc 


acggtgacgc 


cctgggtgct 


37274 


ctgttccgga 


gggaccagca 


cgagccttgc 


37334 
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cccgttgagc 


agcgcccccc 


agacctcgaa ggtggaggca 


tcgaactcca 


gcggtgcgta 


37394 


gtgcagcatg acctcccgcg gcgagagctg gacgtaggag ggctcatgga tcagccgcag 


37454 


cacggcacgg 


tggggaatgc 


agacgccctt gggctcgccg gtggagcccg aggtgaaaag 


37514 


cacgtaggcg 


agcatgtccc 


cgcccagcga gcgctccggc 


ccgagcgcgg 


ggacctgctg 


37574 


cgcggaggcc 


ggctcctggg 


cggcatcgac cagggtcagt 


ggctgcccca 


ggcctgacag 


37634 


caccgtgcga 


gcctgctcct 


cggcgatgac ggtgcgcacg 


cccgcggcct 


gcaccagcag 


37694 


gctcagccga 


cgcgctggct 


ggcgcacatc cagcggtacg 


taggcggctc 


ccgccttgag 


37754 


gatgcctagc 


aggctgatga 


tccgctccag cgtgcgctcg gtggccaacc 


ccactctgtc 


37814 


gcccgcgcgc 


acgcccagcc 


aggccaggta gcgggccagg 


cgttccgagc 


gctcctccag 


37874 


ctctgcatag 


gtgaggtgcc 


gcccgccttg ggtgacggcc 


gtggcgcctg 


gctgccgccg 


37934 


cgccacctcc 


gagaagcggg 


cgccgatgct tgtttcgatc 


gccgcggcgg 


gggagggcgg 


37994 


cggaacctgt 


cgcagctccc 


gggcaggcac caggggcagc 


tccgccacgc 


ggcgcgccgg 


38054 


gagctcggcc 


acgccctcca 


gcagcgtgca ccaggcgccc 


accagccgct 


ccaccgtccc 


38114 


cgcgccgtac 


agcgcgcggt 


catacacggc gatgcagctc 


agcccctctt 


cctcctccca 


38174 


caggtgcagc 


tcgaggtcga 


agcgcgtggt ggccaccagg 


tactccaggt 


gctcggcctc 


38234 


caggcccgcc 


agccgcaacg 


gctcgcgggg cgcgttctgc 


agggcgagca 


gcacctgcac 


38294 


cagcgggttc 


tggttggcct 


ggcgctcggg ctgcaactcc 


tcgacgagcc 


tctcgaaagg 


38354 


cagctcctgg 


tgggcgtagg 


cctctagcgt tgtctccctc 


acccgcctca 


acagctcgag 


38414 


gaaggagggg 


ttgccttcca 


ggcgggtgcg cagcaccagg 


gtgttgacga 


agaagccgat 


38474 


gaggccctcc 


acctccccac 


gtgtgcggtt ggcgatgggc 


gcgccaatgg 


cgatgtccgt 


38534 


ctgcccggtg 


tagcgcgcca 


gcagcgcatt gaagccggcg 


agcaacacca 


tgaagagtgt 


38594 


ggccccgtgc 


tcatgcccga 


gccgccgcag cgcctgcacc 


cgtacgcccg 


ggagcgtgaa 


38654 


gcggtacgtc 


gcgccgttga 


agcgctgcac ttccgggcgg gcatggtccg 


cgggcaggcg 


38714 


cagcggggcc 


aggcgcgcga 


gccgctcccg ccaccaggtc 


agctcctccc 


gcagcaccgg 


38774 


cccttgcaac 


cagccgcgct 


gccacagcgc gaagtccgcg 


tactgcacgg gcagcggcgc 


38834 


caaccccggc 


tcagcgccat 


ggagcgccgc ctcgtacagg 


tgcgacagct 


ctcggctcca 


38894 


aacgctcagc 


gaccagccat 


cgcccacgat atggtgcagc 


tcgaggaaga 


gcagggactg 


38954 


tgcctcttcc 


aagcgcagca 


gggacacgcg cagcagcggc 


cccttgtcca 


gctccagtgg 




ccggagcgcc 


tgctcgcgca 


tccaccgcaa ggcctcctcc 


cgcgcggcga 


tgccatggga 


39074 


gcgcaggtcc 


acctgccgca 


gcgggaccgg agccggcgcg 


tggatgcgct 


gccgaggcac 


39134 


gccgtcgccg 


ccaggctcga 


aagtggtgcg cagcgcctca 


tgccggcgca 


cgagctgctc 


39194 
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cagggcgcgc 


tccagggcgc 


ccacgtccac 


ccggccccgc 


acatgccacg 


catccaggat 


ctggtaggcg 


gtgctttgtg 


gcgcgagctg 


ctggaggaac 


cacagccgct 


gctgggcgaa 


ggagaggggc 


agctctggtg 


gacgtacagc 


gggcacgagc 


ggaggggggc 


ggagctgccg 


gtggccgagc 


aggacctcca 


cctgggccgc 


cagcgccgag 


agctgagggg 


cctcgaagag 


ggtgcgcagc 


ggcaactcca 


cctgcagcgc 


ctctcgcaag 


cgggagacaa 


cccgcgtggc 


cagcagggag 


tgtccgccca 


ggtcgaagaa 


gtcgtcgtgc 


acggataccc 


gttgggtgcc 


gagcacctcc 


ctccagatgt 


ccaccagcag 


cgcctccacg 


gggcctcgtg 


gaggtaccac 


cggagcctgc 


tcgcgggagg 


cctcctcgaa 


gctctcgggt 


agcgccgagc 


gatccacctt 


gccgttgggg 


gtgaggggca 


gtgcctccag 


cagggagaag 


ctggcgggca 


gcatgtactc 


gggcagtcgg 


gtgcggagga 


aggtgcgcag 


ctcctcggca 


cctgggcggc 


caccggcgcg 


ggggacgaca 


taggcgacca 


gccgccgcag 


ggccccttcg 


ccccgcagct 


ccaccaccgc 


cgcggacacc 


tccgagtgct 


cgcacaggtg 


cgactcgatc 


tccgccggct 


ccacgcggaa 


gcctcgcagc 


ttgacctgcc 


tgtccacgcg 


gccagcgaac 


accacggccc 


catccgcctg 


ccgccgcgcc 


agatctcccg 


tgcggtagag 


ccggcctccg 


ggctccaggc 


cgtgcggatc 


cggcacgaag 


cgctccgcgg 


tggcctcggg 


gtggcgccag 


tagcccagcg 


ccagcccatc 


acctcccagg 


tatacctcgc 


ccatggcgcc 


caccggcacc 


ggctgcagcg 


cttcgtccag 


cacgtgcagc 


cgcgagttgg 


cgatgggatg 


acccatgggc 


acggtatgcg 


cctgctcggc 


caggtgttcc 


acccgccagg 


cggtggagaa 


ggtggtgttc 


tccgtggggc 


cgtacacgtg 


tagaaggtgt 


cctggagcgc 


cctgtttcag 


cacccgcctc 


acactggagg 


catccactgc 


ctcgccgcca 


aagagcaggt 


agcgcagggt 


gctgaaggcc 


tggggttgct 


cccgggccac 


gtggttgaag 


agggcggtgg 


tgacgaagag 


cacggagatc 


cgctgctcgc 


gcaggacctc 


ggccagccgg 


cgcgcctgaa 


tggcctcctc 


cgtggcgagc 


cccaccagcc 


gtgccccatt 


gagcagcgcg 


ccccagacct 


cgaaggtggc 


cgcgtcaaag 


gaggcattgg 


aagcctgggc 


gacccgatcc 


tctggcccga 


gctgaacgta 


gtccgtgttg 


cacaccaggc 


gggtcatggc 


ccggtggcac 


acggccacgc 


ccttgggctg 


gccggtggag 


ccggacgtat 


agatgacata 


cgccagatct 


tctccgctgc 


actggggcag 


gggctcgcca 


ccgtcggagg 


gagcggggtg 


ccatgcctgg 


agatccagcg 


tggcgcagcc 


ctggggagcg 


aagggcagcg 


ggccgctgtg 


cacgagcaac 


cgcgcgccgc 


tgtcctggag 


catgaactcc 


tggcgctgta 


cggggtagga 


aggatccagc 


ggcacgtagg 


cggccccggc 


ttggagtacg 


gccagcatgg 


actccacctg 


ctccggcgag 


cgctccagcc 


gaagggcgac 


caggtctcct 


cgccgcacgc 


ctcgggccac 


cagaccctgt 


gcgagcccgc 


gggcccgctc 


cgccaactcg 


ccgtaggtga 


ggctgcgctc 


gccatacacc 


actgccacgg 


cccgcggggc 


ctggagggca 


cgctgctgaa 


agaggccgtg 
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aacgctggcc tctcgcggat aggggcgctg tgtgcggttc cacccggaga gcacctggtg 41174 
gcgctcctgc tctcccagca ccggcagctc cggcacgggc cgatccggcg cctccaccac 41234 
cgcctccagc acctggcgga agccctccag gagccgcctt gccgtgggct cctcatacag 41294 
atccacggcg tactccaacc gcccggtgag cccctccccg gtgtcataca cccacaggtg 41354 
gagatccagg tccagcaccc cccggggaaa gtcgagcggc tgcgcttgca ccccctccag 41414 
cttcagcccg tggccgggca tccatggcgt gtcgctcacc gcgagcgcca cgcgatagag 41474 
cgagctgtcc ccgcgctcgc tgcctggctg gagcgcctcc accaggtgct cgaagggcaa 41534 
ctcctgatgg gcataggcct ccacacacac cctgcgtgcc tgacgcagca actcgcggaa 41594 
gcgggggcgg cctctcaagt cgagccgcag caggacgatg ttgaggagga agccgaggat 41654 
gccctccaag cccgggcggt tgcggttggc ggcggcgctg cccaccgcca tatcgtgctg 41714 
gcccgtccac cgttgcagca gcacgcccag ggcggccagg agcgtcatgt agagggtggc 41774 
gccctcgcct tgccccagcg cgcgcagcac ctgggcttgc ggagcggaga gcaggggcgc 41834 
ccgtacctcc gcgcctctca tggaggggat acgcgggcga ggatgatctc ctggtacctc 41894 
caggggcgtg agcccggcga gctgtcgccg ccagtaagcg agctggacct ggtactcctc 41954 
tccctccacc caccggcgct gccagtcggc aaggtccgcc acctgcaggg gcaggggagg 42014 
tagccgcggc tcggccccag cgctcagcgc gccgtagagc gccgccagct cctgttcgag 42 074 
cacctgcagc ccgactccat ccgccgcgat gtggtggagg ctgagcacca gcacgtgctg 42134 
ctgctcggcc aggcgcagca ggcgcacgcg caggtggggc tccgtctcca gggcgaagcg 42194 
gcgccgggac tcggccaggg ccacctctcg ggcctgctcc agccgctcgc ccacggtggc 42254 
acccggtagc gcttccacct ccagcgccgg ctgccacggc tcgccgatga gcgaataggg 42314 
gactccgccc tcctcgccca gggtggtgcg cagcgcttcg tgtcgcctca ccagcgcagc 42374 
caccgccccg agcagcgcgg ccacgtgcag cgggccggtg aacaccacgg cccccgtgag 42434 
gttgttggcc ggggcctcag ggtggagctg ctcctggaac cacagccgct gctgggcgaa 42494 
ggagagcgcc agccgcccgt ttcgagccac tggggccagg aggctgcctc cctgagagga 42554 
gggctggagc tgccgtagca gtgcgatgag ctcatggcga gcgcctccca ggcgggactg 42614 
-sgctcgggc gtcatcaccc ctgggggagc ctggaagcgc agtcgctcgc cctctaccca 42674 
caggcgtaca cccaggctcg cggcgtgggc gagcaacgct ccggcgctca aagttctccc 42734 
tcctcgatgg aggcagaggg agggtgggga gcctgggtgc tccagctctg agcctcgaga 42794 
""----"3. tggccgccag ggtgggctgc tcgaagagca cccgcagggg cagctccagc 42854 

......, A ., ; -,^ C cgcggafcgag cgacaagagc C gtgtggcca gaagcgagtg gccccccagc 42914 

tcgaagaagt tgtcctccac ccccacgcga tgcacgccga gcacgcgccg ccacagctcg 42974 
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gcaagccggc gctcggtggg ggtccgcggc gcgaggtggc ctggcgtttg ctcggcttgt 43034 

gcgggctcgg ggggaggcag tgccgccacc tccaacttgc cggtgggctt gagcggcagc 43094 
gcgtccagca gcacgaagga agagggcacc atgtactcgg gcagcttgtg ctcgaggtgg 43154 
cttcgcagca ctcccacgag ggtgcgccgg cggctggccg ccaggggctg gttggcgtag 43214 
ctcttccagg gtcgaggggg aggcggggga ggcgtcagcc aggccgccgg gcgcgaggag 43274 
ccagccgcgc gggccatgag cagcgcgtcg aaacggcctt cctcgtcctg ggcgctccag 43334 
gtgacgtggc acgtgtagtc ggctgcctcc gccagcctcc agagctgctc tggattccag 43394 
gatgtcaccg ccggcagtgc ccgcagccgg cccacgctgt tgccagtccg cagcagcgcc 43454 
agccgtgccg cctcgtccgc cgtgcgggcg ttggccacat tgcgcagcac cagccccgcc 43514 
ggtcgctccg ccagcatgcg ctccagcctt tccaggccga gcccctcatg cctccagtcc 43574 
agctctggca gctcttgcgc ggggcctgct ttcgctacgg gggcgagctg cagctccacg 43 634 
tcgtagcgga agcggttcat ctcgttcctc ccgctcccgc gcttgaggtg cagccggacc 43 694 
gcgcccagct ggggaaagtg cgtggcgagc gcggcgaaga agtccgggtc cacgtacaac 43754 
cgttcgtcca gcagcatgcg ctggcgggtg cgggcgagca gcgctggcgc ctccaggtcc 43814 
gctgacgcct gctccagggc caccgaggca tggaacagct cgaagagccg cagattctgc 43874 
acgtcgccaa tgaacagggt tccgcccggt tggagcacgc gcagcgcgcc ctccacgacc 43934 
cgcagcaggt agtccacgct ggggaagagc tgcgtcaccg agttgaggat gacggtgtcg 43994 
aagccggctt ccggaagccc ggagaagtca tccgccggct gggccatgag ccgcacgcgc 44054 , 
tgcgcgagcg gggagccgcc cgtctggagt tgctccctca gccgctcgac ggctaccggg 44114 
gagagctccg tgccccagta ggactcgcag cggggggcga gccggcggag caggagcccg 44174 
gagccacagc ccagctccag cacccgccgg ggaacgagct ccatgagccg ctccactgtc 44234 
gtctccaccc actctcgcat ctgctccgga ggcagcggct cgccgcgcac gctgtcattc 44294 
cacccggcca ggtcaaaggc cgtgccgctt gggccgccat aggtctcatc ccacaccgcg 44354 
cgccagcgct ccacgtgctc tgtctccgcc ccgtccgtgt cctcgagccg tgcggccacg 44414 
taggccacca ggcgccgctc tccggcgctg tccacgtagg gccgcacgtg cgcttggtgc 44474 
acggcggggt gctcgcacag caccacctcg atctccccag gctcgatgcg gtagccccgg 44534 
atcttcacct gcgcgtccat ccgttgcagg aactccagcg ctccgtcctc tcggaggcgc 44594 
gccaggtccc cggtgcggta gagccgggct ccaggctcgc gggagaatgg atcgggcacg 44654 
aagcgatcag ccgtctgctc cggacgtccc aggtagccgc gcgccagggc ggggccgccc 44714 
aggaacacct ctccggccac acccggaggc acgagctgct gctccccgtc cagcaacagc 44774 
acccggaggt gggccagtgg ccggcctacg ggcggcagcc gtggccaacg gctcggctcg 44834 
cgtgtcaggt catggctcgt caccacgtgg cactcggagg ggccgtactg gttgtggagg 44894 
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gtgcactccg 


gcatccgcga 


gagcagccgc 


tgeaactegg 


gggtcgcctg 


gagttgetec 


44954 


ccggcggtaa 


tgagctggcg caggtggcga 


ggggccgcgc 


cgcgctcatc 


cgccacccgc 


45014 


gcgagctgct 


gcagggcgat 


gaagggcagg 


tacagccggc 


tcacctgctc 


ctgctccagc 


45074 


acttccagca 


gccgggcagg 


atccctgcgc 


acctcttccg 


tcaccatgac 


cacggtgccc 


45134 


ccagcggccc 


aggtggtgaa 


gagttcctgg 


tagcagatgt 


cgaagttgag 


egaggegaac 


45194 


tggagcgtgc 


gtgaggggcc 


acccagccgc 


tcatgggtga 


gctgccaggc 


caccaggttg 


45254 


gccagacagc 


gatgcaccat 


cgcgatgccc 


ttgggcgtgc 


ccgtggagcc 


agaggtgtgg 


45314 


atgacgtagg 


ccaggttgtc 


gagcccggcc 


tcgtgtggcg 


catcctcctg 


gggctcgcgc 


45374 


tccagtgcct 


ccgcctccct 


ctccagacac 


aggagegegg 


cgttcgtggg 


gggcaggege 


45434 


tggagcaggt 


gctcctgggt 


gaggaggatg 


cggcagctcg 


catctcccag 


gaggaaggcg 


45494 


agccgctcgc 


ggggctgatt 


gacgtccagc 


ggaaggaagc 


cggccccggc 


cttccatatg 


45554 


gccagcagtg 


ccaccggcag 


ctccagggag 


cgctccacgc 


acagccctac 


cagggtgtcc 


45614 


gggcccgcgc 


ccaggcgccg 


cagccggtgg 


gcgagcgggt 


tggcgcggcg 


gttgagctgc 


45674 


tcgtaggaga 


gcacctgcgg 


gccgaagcac 


aacgeggggt 


gatggggggt 


gagccgcacc 


45734 


tggtgctcga 


agcggtggtg 


ggccagcacc 


tgcgcgccct 


ccacgggccc 


gcccaggccg 


45794 


tgccccgcca 


gcaaagcctc 


gt get caeca 


ggcgtgagga 


gcgccagccg 


cgagagcctc 


45854 


tcgtgaggcc 


gctccacgag 


ctgctccagc 


aggtgcaggt 


actgagtggc 


ggcgcgctgc 


45914 


atgcgcgcct 


cgctgaacag gtccgcgctg tagtccagcg ccccctccag 


cccctcaccg 


45974 


cgcggctcca 


gcagcagagt 


gagatcatac 


gccaccaggc 


cggtgaagga 


ctcccagggc 


46034 


cggaaggcgc 


agccgccagc 


ggtaggctgg 


ggctgctccg 


gcagcaggtc 


gaacagcacc 


46094 


tggaacagcg 


ggtggcggct 


ggggctgegg 


gcaatgtcta 


gcaggctcac 


cagccgctcg 


46154 


aaggggagcg 


ccgcatgggc 


gttggcctca 


cgcacctcct 


gatcgagctg 


ccgcagcgcc 


46214 


tcccgcaccc 


gggtgtgagg ctccagccgc 


gagegcaatg 


ccaccggggt 


gecgaagtat 


46274 


ccaatgaggc 


catgcagctc 


gggccggggg 


cgctgcgggg 


cgggccaccc 


caccaccaat 


46334 


tcctcctggc 


gagagtagcg 


gtgtagcagc 


gtggtgaagg 


cggtgagcag 


cagggcaaag 


46394 


agtgacacgc 


cctccttgtg 


cgccaggtcc 


tccagcgcct 


ggcgcaggcg 


cggctgcacg 


46454 


Q* tga t c^gc ca 


cccgccctgc 


ccggtacgtc 


tgtctgaggg 


gacgcgctcc 


ctccgtgggc 


46514 


agctccagca 


ccgtgggcac 


cccggcgagc 


cgctgccgcc 


accaggcctc 


caaggcctct 


46574 




cgaagcttgc 


ctcccagcca gcgtagtccc 


tgtattggac 


gggcagtggc 


46634 


gccagctccg ggctcgagcc 


ttccacgtgc 


tggcegtaga 


gcatcgccag 


ctctttgatg 


46694 


agca.cc.tcgc 


agctccaggc 


tccatccgag atcaggtgat 


gcagggtgac 


cagcagcaca 


46754 
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tgctccgttt ccgccagtcg cgccagccgg actcggaaca gtgggccttg ctgcaggttg 46814 

aagggcgccg cagcctccac gcgcgccagc tccgccaggg cccgagccct gtccgcctgg 46874 

gcgaccccgc tcagatcgct gagaggcagc tcgggccacc actcgggccg cacacgctgc 46934 

tcagggcccg ccgcgccaat ggcaaccacc agccggagcg cctcatggcg cttgaccatg 46994 

agccccaacg cccgccgcaa cccctctacc tgcaaaggcc cctgtatgtg atgactttgc 47054 

aacaaatgct gggccgcgtt gcctggctcg agctgctcca ggaaccacat gcgccgctgt 47114 

gaggatgaga ggggagcctt ttcgccgctg gcgtgccgca ccgcgggcag cgcctcgtcc 47174 

gacghggcga gtcccttggc ccggagggct ttcagcgcgt gctcggcagt cgctgtggct 47234 

gtcaattgca cccctcctgg aattgaacaa gggggggggc tgtgcccgat agccaatcca 47294 

tcgatttcat aaagcccccc acctctattg gcatacctac tgtccagcgt cataccaagt 47354 

caagccaccg tccagcttgc tccttctggc atgtcgctga tagtgatgag ttccacagac 47414 

gcaacaggtg acccatggag ggggagagcg cc atg gcg gcc cgt tgc egg cag 47467 

Met Ala Ala Arg Cys Arg Gin 
1665 1670 

gca cgc ggg gca tec atg aca aga att ctt ctt ttc tct ggc aag 47512 
Ala Arg Gly Ala Ser Met Thr Arg He Leu Leu Phe Ser Gly Lys 
1675 1680 1685 

ggt ggg gta ggc aag acg acg gca tec gcg gcc ace gcg gtg gcc 47557 
Gly Gly Val Gly Lys Thr Thr Ala Ser Ala Ala Thr Ala va! Sa 
1690 1695 1700 

get gcc aag cgc ggc tac cgc ace ttg gtg atg tec ttc gac att 47602 
Ala Ala Lys Arg Gly Tyr Arg Thr Leu Val Met Ser Phe Asp lie 
1705 1710 1715 

111 %V T CtC ga ° ag ° ttC gat Ctc gat cgc aag etc ttc 47647 

Ala His Ser Leu Ser Asp Ser Phe Asp Leu Asp Arg Lys Leu Phe 

1720 1725 1730 

gac ttc aac gag gga ctg ccc cag aag gtc gcc ccc aac ctt gaa 47692 
Asp Phe Asn Glu Gly Leu Pro Gin Lys Val Ala Pro Asn Leu Glu 
1735 1740 1745 

ctg caa gag ate gac ate cag cac gaa ctc cag egg cag tgg agt 47737 
Leu Gin Glu H e Asp He Gin His Glu Leu Gin Arg Gin TrJ Ser 
1750 1755 1760 

^ fr^ l at aaC taC atg tcc gtg ct 3 cfc S ac c tec ace ggg gta 47782 
Glu Val Tyr Asn Tyr Met Ser Val Leu Leu Thr Ser Thr Gly Val 
1765 1770 1775 

tcc aac atg gtg gcc gag gag gtc gcc att ctc ccg ggc acg gag 47827 
Ser Asn Met Val Ala Glu Glu Val Ala He Leu Pro Gly Thr Glu 
1780 1785 1790 

vll Se s2 r CtC S C ^ Ct9 ^ C f 9 taC gtg CSg Cag ggg 47872 
Asp val lie Ser Leu He Tyr Leu Asn Gin Tyr Val Gin Gin Gly 

1795 1800 1805 

cgc tat gac gtg ate ate gtg gac tgt ccg ccc acg ggc gag teg 47917 
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Arg Tyr Asp Val He lie Val Asp Cys Pro Pro Thr Gly Glu Ser 
1810 1815 1820 



etg cgc ttc gtc 
Leu Arg Phe Val 
1825 



aac ate acc age gtg ctg cag tgg tat ate cgc 
Asn He Thr Ser Val Leu Gin Trp Tyr He Arg 
1830 1835 



47962 



egg cgc ttc aat gtg gat cgc acg etc gtc aag ctg gec egg ccc 48007 

Arg Arg Phe Asn Val Asp Arg Thr Leu Val Lys Leu Ala Arg Pro 
1840 1845 1850 

ctg gec agt egg etc acg age tat gat ctg ccg gag gac age tac 48052 

Leu Ala Ser Arg Leu Thr Ser Tyr Asp Leu Pro Glu Asp Ser Tyr 
1855 1860 1865 



ttt gee tec etc gag cag etc ttc gat cgc gtc aag ggc att gaa 
Phe Ala Ser Leu Glu Gin Leu Phe Asp Arg Val Lys Gly He Glu 
1870 1875 1880 



48097 



ggg ttg etc acg gat ggc aat cgc acc acg gtg cga ttg gtc tec 
Gly Leu Leu Thr Asp Gly Asn Arg Thr Thr Val Arg Leu Val Ser 
1885 1890 1895 



48142 



age gcg gaa aaa atg gtc ate cgc gag aca cag egg gec tac etc 
Ser Ala Glu Lys Met Val He Arg Glu Thr Gin Arg Ala Tyr Leu 
1900 1905 1910 



48187 



tac ttc aac atg 
Tyr Phe Asn Met 
1915 



tat ggc atg acg gtg gac cag gtc gtg gtc aat 
Tyr Gly Met Thr Val Asp Gin Val Val Val Asn 
1920 1925 



48232 



cgt ctg ctg ccg gac acg gaa tac ttt get cag tgg cac aag tct 
Arg Leu Leu Pro Asp Thr Glu Tyr Phe Ala Gin Trp His Lys Ser 
1930 1935 1940 



48277 



caa tct gee tac ctg gat cac ate cag gag tac ttc tec ccc atg 
Gin Ser Ala Tyr Leu Asp His He Gin Glu Tyr Phe Ser Pro Met 
1945 1950 1955 



48322 



ccg gtg gee agg 
Pro Val Ala Arg 
1960 



etc ccg ctg ctg gag 
Leu Pro Leu Leu Glu 
1965 



cac gag gtg gtg ggg ctg 
His Glu Val Val Gly Leu 
1970 



48367 



gag egg ttg gag 
Glu Arg Leu Glu 
1975 



gat ctg gcg cag egg etc tac ggg gac teg gac 
Asp Leu Ala Gin Arg Leu Tyr Gly Asp Ser Asp 
1980 1985 



48412 



ccc acc gag cgc tac ate tec ggg cag ccc tat cag ttc gee aag 
Pro Thr Glu Arg Tyr He Ser Gly Gin Pro Tyr Gin Phe Ala Lys 
1990 1995 2000 



48457 



cag cac gag ggc cgc tat egg ctg cag 
Gin His Glu Gly Arg Tyr Arg Leu Gin 
2005 2010 



etc etc atg cca ggt gec 
Leu Leu Met Pro Gly Ala 
2015 



48502 



gac cgc gga gac att gtc ctg gat cgc cag ggc aat gac etc ate 
Asp Arg Gly Asp He Val Leu Asp Arg Gin Gly Asn Asp Leu He 
2020 2025 2030 



48547 



ate cgc gtg ggc ggc ttc cgc cgc cac gtc atg ctg ccg cgc 
He Arg Val Gly Gly phe Arg Arg His Val Met Leu Pro Arg 
2035 2040 2045 



teg 
Ser 



48592 



WO 2004/022586 



169 



PCT7EP2003/009780 



gtg gcg cag ctg gat gcc gtg gac gcg gtc ctg gac aag ggg atg 
Val Ala Gin Leu Asp Ala Val Asp Ala Val Leu Asp Lys Gly Met 
2050 2055 2060 



48637 



ttg aag ata gat ttt gcc egg ccc tga actggagee gtg aat gca gtg 
Leu Lys He Asp Phe Ala Arg Pro Val Asn Ala Val 

2065 2070 



48685 



agg cct gtc atg cag ctg aaa ggg gcg ggg tea gcc atg egg atg 
Arg Pro Val Met Gin Leu Lys Gly Ala Gly Ser Ala Met Arg Met 
2075 2080 2085 



48730 



aca agg ate ctg ggt gag cag gaa ctg cgc cag ate gtc cgt gca 
Thr Arg He Leu Gly Glu Gin Glu Leu Arg Gin He Val Arg Ala 
2090 2095 2100 



48775 



acg ggg ctg cac gcg ctg atg gat gag gtc ate tec 
Thr Gly Leu His Ala Leu Met Asp Glu Val He Ser 
2105 2110 2115 



acc etc gag 
Thr Leu Glu 



48820 



tac gag ctg gag cac ttc gac ccc gcc aac acc gag ctg cgc aag 
Tyr Glu Leu Glu His Phe Asp Pro Ala Asn Thr Glu Leu Arg Lys 
2120 2125 2130 



48865 



cgc gag ggc ttc cag tat cag cac ccc cac ccc ggc gtg ttg gag 
Arg Glu Gly Phe Gin Tyr Gin His Pro His Pro Gly Val Leu Glu 
2135 2140 2145 



48910 



tgg atg ccc gtc atg cag gtg 
Trp Met Pro Val Met Gin Val 
2150 2155 



9QQ cgc cac gcc gtg ate aag ctg 
Gly Arg His Ala Val He Lys Leu 
2160 



48955 



gtg ggc tac aac ccc cgc age ccg gag gag ctg ggc 
Val Gly Tyr Asn Pro Arg Ser Pro Glu Glu Leu Gly 
2165 2170 2175 



ttg ccc acc 
Leu Pro Thr 



49000 



att ctt tec tec etc agt ctg tat gat gtg acg acc ggg cac ctg 
He Leu Ser Ser Leu Ser Leu Tyr Asp Val Thr Thr Gly His Leu 
2180 2185 2190 



49045 



egg gcc gtt tgt gat ggc acc ttt cct acc gcg eta cgc acc ggg 
Arg Ala Val Cys Asp Gly Thr Phe Pro Thr Ala Leu Arg Thr Gly 
2195 2200 2205 



49090 



gcg gcc tec gcc gtg gcg agt egg ctg etc gcg egg 
Ala Ala Ser Ala Val Ala Ser Arg Leu Leu Ala Arg 
2210 2215 2220 



gag gac age 
Glu Asp Ser 



49135 



egg gta ttg ggc ctg gta ggg tgt ggc get cag gcc 
Arg Val Leu Gly Leu Val Gly Cys Gly Ala Gin Ala 
2225 2230 2235 



gtc acc cag 
Val Thr Gin 



49180 



gca cat gcg etc teg cgc gtc ttt cct ctg gga egg gtg ctg gtg 
Ala His Ala Leu Ser Arg Val phe Pro Leu Gly Arg Val Leu Val 
2240 2245 2250 



49225 



cac gac att get ccc gaa gcc 
His Asp He Ala Pro Glu Ala 
2255 2260 



atg cga age ttt gcc egg cgt gtg 
Met Arg Ser Phe Ala Arg Arg Val 
2265 



49270 



gcc ttc ctg ggc ctg gag gtg gtg gcg gtc ccc ctg acg cgc gtg 
Ala Phe Leu Gly Leu Glu Val Val Ala Val Pro Leu Thr Arg Val 
2270 2275 2280 



49315 
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gag cag gag gca gac ate etc tgc acg gec acc tec gtc gee gtg 
Glu Gin Glu Ala Asp lie Leu Cys Thr Ala Thr Ser Val Ala Val 
2285 2290 2295 



49360 



ggc gcg ggg ccg gtg gtg tec ggg cgt ctg ate aag gee cac etc 
Gly Ala Gly Pro Val Val Ser Gly Arg Leu He Lys Ala His Leu 
2300 2305 2310 



49405 



cat ate aac gee gtg ggc tea gat ctg ccc ggc aag 
His He Asn Ala Val Gly Ser Asp Leu Pro Gly Lys 
2315 2320 2325 



aca gag ctg 
Thr Glu Leu 



49450 



ccc ctg gag atg ctg cgt cgc gee ctg gta tgc ccg gac tat etc 
Pro Leu Glu Met Leu Arg Arg Ala Leu Val Cys Pro Asp Tyr Leu 
2330 2335 2340 



49495 



ccc cag gec gtg gtg gag ggg gag tgc cag cag etc 
Pro Gin Ala Val Val Glu Gly Glu Cys Gin Gin Leu 
2345 2350 2355 



gag cct gga 
Glu Pro Gly 



49540 



gag att ggc ccc age etc ttt gaa ctg acc cgc tct 
Glu He Gly Pro Ser Leu Phe Glu Leu Thr Arg Ser 
2360 2365 2370 



ccc gag cag 
Pro Glu Gin 



tgg cgc tct cac cgc gat ggc etc acc gtc ttc gac tec acc ggg 
Trp Arg Ser His Arg Asp Gly Leu Thr Val Phe Asp Ser Thr Gly 
2375 2380 2385 



49585 



49630 



ttt tec ttg gag gac aag gtg gtg ctg gaa gtc ate ctg gcg cac 
Phe Ser Leu Glu Asp Lys Val Val Leu Glu Val He Leu Ala His 
2390 2395 2400 



49675 



gee gag egg ctg ggg gta ggc cgc ctg etc cag gtg 
Ala Glu Arg Leu Gly Val Gly Arg Leu Leu Gin Val 
2405 2410 2415 



gag tec atg 
Glu Ser Met 



49720 



gee gag gac ccc aag gat ccg tat ggc ttt etc cag gag aca gcg 
Ala Glu Asp Pro Lys Asp Pro Tyr Gly Phe Leu Gin Glu Thr Ala 
2420 2425 2430 

acc tgc ctg aaa get gee cca gee gta taa egagecaagg tgttcga atg 
Thr Cys Leu Lys Ala Ala Pro Ala Val Met 
2435 2440 



49765 



49815 



aga ata etc acg gat gta ttt ctg egg aag tac cag cga gec cct 
Arg He Leu Thr Asp Val Phe Leu Arg Lys Tyr Gin Arg Ala Pro 
2445 2450 2455 



49860 



gga ccg cgc gag cgt tec ctg gee gcg gac gcg cgc tec aag ctg 

Cly Pro Arg Glu Arg Ser Leu Ala Ala Asp Ala Arg Ser Lys Leu 
-460 2465 2470 

tgg clrg tec gtt gga ctg acc gee ttc etc gtg cct gee tat att 

Trp Leu Ser Val Gly Leu Thr Ala Phe Leu Val Pro Ala Tyr He 
2475 2480 2485 



49905 



49950 



cj^u uao ccc tgc cgc atg ttg ggg ctg ccg aag ggc gag ctg ccc 49995 

■ - ^V- Phe Cys Arg Met Leu Gly Leu Pro Lys Gly Glu Leu Pro 
rj: "" 2495 2500 

r^cr aacr etc tat gac atg ctg cgc gag cgc aat gag gcg cca gac 50040 

Trp liys Leu Tyr Asp Met Leu Arg Glu Arg Asn Glu Ala Pro Asp 
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2505 2510 2515 

etc aag caa cgc ttt cag cag etc gag cac ctg ccc gag ggt tec 50085 
Leu Lys Gin Arg Phe Gin Gin Leu Glu His Leu Pro Glu Gly Ser 
2520 2525 2530 

ttt ggc cgc gag ttc tgg aag tac tgc cac tec cat ggc gtg ccg 50130 
Phe Gly Arg Glu Phe Trp Lys Tyr Cys His Ser His Gly Val Pro 
2535 2540 2545 

ctg ccg ggt cac cct cac gee ctg ccc gtg cac teg gec gtg cat 5 0175 

Leu Pro Gly His Pro His Ala Leu Pro Val His Ser Ala Val His 
2550 2555 2560 



gac ttc gtc cac ctg etc tec ggc tat ggc ate tec ate tgg gag 
Asp Phe Val His Leu Leu Ser Gly Tyr Gly lie Ser lie Trp Glu 
2565 2570 2575 



50220 



gaa atg ctg acc acg gee ttc age ctg ggc ttc atg ttc ggc ccc 502 65 

Glu Met Leu Thr Thr Ala Phe Ser Leu Gly Phe Met Phe Gly Pro 
2580 2585 2590 

cgc acc cgc aac tat gat ccg gee cag tac aag tec ttg gtg ccc 50310 
Arg Thr Arg Asn Tyr Asp Pro Ala Gin Tyr Lys Ser Leu Val Pro 
2595 2600 2605 

gtt cgc ctg etc gee gag gcg ctt ccc att ccc cag ctg egg cgt 503 55 

Val Arg Leu Leu Ala Glu Ala Leu Pro He Pro Gin Leu Arg Arg 
2610 2615 2620 

gcg gtg gag cga ggc age gcg atg age gtg gac etc ttc gga gac 50400 
Ala Val Glu Arg Gly Ser Ala Met Ser Val Asp Leu Phe Gly Asp 
2625 2630 2635 

tgg gat ccc tgg aag gtc atg gag ctg ccg ctg gaa gag gtc aag 50445 
Trp Asp Pro Trp Lys Val Met Glu Leu Pro Leu Glu Glu Val Lys 
2640 2645 2650 

egg atg tac aac ate cag ccg gag tag ccgctcagcg cgtcgcctcg 50492 
Arg Met Tyr Asn He Gin Pro Glu 
2655 2660 

gccagcagct cccaggtgcg egaggegage tgctccacct gctcgcgctg catgtgcgcg 50552 

gtgttgaaga gaaagctggc ctccaaccgc tgggcaaagg tgatcgccgt cagcaccacg 50612 

tcgaactggg ccgcggtcgt ccccgtgaag ccaaggccct egagegtcag gttcccatag 50672 

egggeggcaa tctccacacg ccccacgttg etcagggega gcatggagtg caggcccagc 5 0732 

cgtgtcgcgc gttccatcca gggctgggag gagatgtcca eggtgagege ggatatctcc 5 0792 

aggatcaggc ggaacagect caagtggctc cgcttcttcc gtgctgcctg cagaggegat 50852 

eggagtctgt gcgccagctc ccagaatggg gtgctgccgc gcgcgcggaa gaagtccgtc 5 0912 

ttccccgtgg tgaacaggee gaagtcctca cccacgccaa agcgctcccg agcatccaac 50972 

gcagaggtga gggtgagcgt cacccgcttc ttggcaccct gcgactcggc ccgagctatc 51032 

ageagegegg ccccgagcgc tgcgtggacg gaggaccctt gctctcgaca gcgcgcaagc 51092 

agccgcgcga getgeteggg ctccagaatt cggtggctgt agagegectg tccctcggcg 51152 
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ctgctgcgag ccatgccccg cggtgcgacg agccgctctg tgagcgtatt gagggtcgga 51212 
gacaagcgca gcagcgccga ggagccccgg cgcacggcgg cgccaagcag tcgggtgccc 51272 
ggtaccatcg gcccaatcag ctcctcatag gcgggccttg gggcgagctc gggcaggtgc 51332 
tggtcctggg aggccatgac ctgcagcaga tcgcgcaggg cgaagatgcc ggagtgcgcg 513 92 
tctccgatca gatgatcata gctgcacagc agctcggagc ggtgctcgcc ccgcacgagg 51452 
atgaagcgga cgagcggccc agggtccgag ttgaaaggcc tgtcaagctc tgcctcgaca 51512 
tgctgttgcc aggactcctc tgccttgcgt tccaccactt ccagcgcaat ggggggcaca 51572 
ccttgcgagg tgaagcgtgg cggagtaccc gcgacgatct tcgctcggag cagcgggtgt 51632 
cgccgttgca gggccaccaa cccctggcgc acagcctgct cctccacgcg tcctcgcagc 51692 
ttgcccgtaa tgacgatgtt gaaggggcgc ttctgccgta ccttatagaa gacctgctct 51752 
gtcgttccga gggagcgctc catatgatgg atagtaccag agcctgggtg cgcgaagcgc 51812 
taccatggag aagcccaaga agagcaagga gaggcatttc ccaggattcc tgaaatgaag 51872 
cccacgggcg tcaacggatt tcgaaagttg tgatcctcgc ctctgtttcg ttattcggtt 51932 
acgaaaggga ggtcgttcgc ttctgacgtg gccctggtct gttccggcca ctcctgaagg 51992 
ctcagtccct cgccattatc gacctcatct tccgccgcgt tgtggagcgc atggagcacg 52 052 
gccgggttca cgtcgcccca tgagaatcga actcaccagg gacgcctctc agcgcccctc 52112 
accggttttg aagccccgct cccccgatcc gcgaggtgcc cggagggatt gagggtcggc 52172 
cggtttccct ctggagcggg gtgggtggag cgtccatgga ggtccacccg cgtccgttca 52232 
tcttgggtac agcttgggta cgagggtggg gacggcaggg gtcaggggcc attggctgca 52292 
ccggggacgg tgacgtccgt gaactgggtg ccggagatcc cgcgcgctgt ctcgggcccc 52352 
aaagaatgcc tacaaaatcg ccttggttgc agctgcgtcc ttgagaagcc gggtccctat 52412 
gccagttttg ttaggcgctc gtatcaccgc tcgtatcacc gcatttaacc actgccttac 52472 
gattccaggt cgagcggcag attgagccta accgcgtcct cgacgagcgc ttcgaagtta 52532 
gaccgcctgg gctgggcctc caaccacctg tagaactcag ttttcaactg gggatcagtg 52592 
gattcggtga gcagcatctg ttgttcggcc ggattttcca caacgtgttc gaggacagcc 52 652 
tcgggcggca ggcccagcgc cgtcttcgcc acgtcttgat agctcccttg cgagcccagc 52712 
ttcgcattga atgtcttgcg gtcatggcgg gtggtgtctc tgttccatga agcctgcttc 52772 
gtggggatct tgatgtgctt cttcgcggcg acgtggacgt gccgctgctg cttctgagct 52832 
gggcgctcgg gatccacccg ataccagatg ccagaaatgt gcgagggtcg ccactcaccc 52 892 
tcatccagga gaagcgccgc atggcggagt tgctcctcag ggatctcgat caagagccaa 52952 
yydaoglicgg cggggacatc gcggtcatca tcggcttcaa acagttcgta tagcgtcatg 53 012 
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ggtcgggttc tccacgggtt ctaaacgaag gctacgcagc ctccgcctca ccctgcaata 53 072 
tcgttgttgt agacgctgaa tccagcgtcc agccgaggtc aatgctcacc ttcgcaacga 53132 
gaccaaacca acactattta gtcgtctggg cgtaggccgg atcgaatgcg accatgagct 53192 
tccagtgata gttgccctca tccagcggtg gcagtgggtg gtattcaggc cgaatttgct 53252 
tccgcatcca ctccaactgc ggagggcttg ataaatattg ccaaagcccc gaagcagatg 53312 
gcggcccctc gttgcgaagg cgcaaacgtg caatgcttga gatttcggca cccgtggcag 53372 
catcaaatac gggaacatga gggctgtcga agtatctaaa caggcaggag tcgtctgccg 53432 
tcacgatctc tccagttctt cggcccgtaa agttcatgat gttggtcagt attatggcat 53492 
ggatgtttct gcgcgctggc ggagtctcgg ggagtgaaac tccctgtgct tggctcctga 53 552 
actccttcca gtcagtttcg gcaatagaaa cacgccgtac caattggtgc tcggcgcgct 53612 
cgatctcctc cctcgccccg aactcatcca caggagttgt tgtcatcttc aggcacttaa 53672 
gctcgaagat aaataggtgc ccttgccaat aaagcaccaa gtcgtattcg atctgccgcc 53732 
catctgaagc aggaaatttg atgccccgca cagcgactgc ttgtgttaat tcctcaagcc 53792 
tcgattgaat catctgctca taatactctc cccgggagtc aaacgcggag cttgagcgag 53852 
ctgaagctaa atgctctatt gtgcgaatag gatgggaatt ggcaatgatc gaaggaacga 53912 
acaacaccct atggtcatct actggtacga gcggctggtc ccagatctct aggttttttc 53972 
tctgtgcgtc gaagacgatg ttgttgatat gggaaaagca gaattcgaat tcgatgcccg 54032 
acgcctcgct gattaccttg cataggcgct gcttggtgac tgcaaaaaca atgttgtgta 54092 
ttgctgactt ggtcttgaat gttgcgtgat tcctttgcag cagagccatg gcgagttcat 54152 
gcagcgtgaa gttcaacttc aactccttct gtgacttctc cagttcagga tctgctggcg 54212 
ctcccattac tggggttagg aatcgacgcg acatgtagat cttgcgagcc gtggcatatc 54272 
gatcaagacc gaggagatct cgcagggctt taatatcaat cttaaggtcc catgttttcc 54332 
ctggcttcgg gagattgaag ctcctcttga ggcgttccag gacttctctt ctactctcca 543 92 
gttctgcccc aggaccgagt ccatcagaca gcattgcttg ctgcaaaaat aggcgctcgc 54452 
gaatgtttcc gatttcgctt tggaggtatt gcgccttgtt ttttggtgtc cagatgaagc 54512 
cgcgtggatc atgctcaggt cgccagtcgg tatactggat ttggttccat agcaggctga 54572 
atcgatgcca cgcgtgtgcg caagcgagga gcctgccaaa gtcaggatcg tcataagggg 54632 
tctggggggt aaacttttca tttgggggtt gggtgtcgat cctgtgcatg gagatgcgga 54692 
gtattagttc ggcggcttcg gcggtagatt caaaaacaag atcgacactt gtcgttgcgg 54752 
tcccatcggt tcctggaacc gcgattgaag ggctcgccat gacaaatggc gagcccggtg 54812 
gggcccgctt gagcttcttt cgaatcgtat tgctaacaaa tatgtattgg gattcaacgt 54872 
acgtaaatag gcgccacatc attgtgaggc gcgaagattt ccagaattct gatttgtcgg 54932 
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actcttgcga 


tacgatctgc 


atggcccttg 


cgatgctttc 


gtatccgcgt 


agcgtgtgct 


54992 


gctcttctgg 


cgcaagatct 


tctgccaatc 


tgtttattgc 


ggatactttt 


tcgaagaatg 


55052 


ttggattcct 


aggatctatg 


gggcccagtg 


cgagggcgat 


caaattcagc 


cactgagatg 


55112 


gattctccag 


tctgatggac 


tggatgagtg 


cttgaatctt 


ttctgagctt 


ccttcgttta 


55172 


gggtggctcg 


cagcttcttg 


attattttct 


tcgtttgctc 


ggctgtagac 


atggggcttg 


55232 


gtttggatgg 


tgtaaggatg 


tagttattgg 


ttttcaggtg 


tttgaacggc 


ccatgacacg 


55292 


cacgggcgcg 


gacttcatgc 


tatgtgggtg 


ctagcttgat 


agtgaggatg 


ttctcaacat 


55352 


tatgggaatg 


tcaccccgga 


tagggtgatc 


gtgcgcagcc 


cgttcgattc 


ccacagtttg 


55412 


agggtgaacg 


tgtcctgggc 


cgcgtccggc 


tcagcctctg 


cttccaccac 


gaggcgccgg. 


55472 


atccgggcat 


tggcaacggg 


cgcttcctgc 


catacaggga 


gcatgctcag 


cgtttcacct 


55532 


gtcttgctca 


ccaacgatgc 


tccctgcgcg 


atccagcgtt 


gcgcggcttc 


tgggtagtcc 


55592 


aaatccatga 


gcacggacac 


gcggttggcg 


gagcggtagc 


tccatgcacg 


aagaatcctg 


55652 


atcgcgtctc 


caggacgctg 


gcggacttcc 


ttctcgaagt 


cgaggggctt 


gaccgaaacg 


55712 


cccgacttgc 


cgatcacgcg 


gttcgcaagg 


aggccagcga 


ggccgcccgg 


tcccgcatac 


55772 


tcggcgcgca 


ggcgctcgtt 


ctcttcgtga 


cacctttggg 


cctccgctcg 


ggcttctttc 


55832 


gattcctgtt 


gataggactc 


caccgtgcgc 


tggttgcgcc 


acacctcgac 


caagcgctcc 


55892 


gcctgagcgg 


ggtgcgcgac 


gaggatgaat 


gccatgccta 


ccggcgctgc 


tccgtccagg 


55952 


aaccgcaccg 


tcactcgcag 


gcgttcgccg 


ggcaggactt 


gctccgacgg 


caccagccgg 


56012 


agagtgcttc 


gtccaatgtc 


cacgagcgag 


aagcggtcgg 


cgccttcaac 


cgcgaccttc 


56072 


tcctgaagga 


gggcggagtt 


gaactcgaac 


gtggtggaaa 


cccccgggct 


tatctgtacc 


56132 


tccggtgtcg 


tcattgcctc 


gaagttctcg 


gccgagaact 


caatccggcg 


tacccccaag 


56192 


cccgtggctg 


ctggacgcga 


ttgcgcggcg 


gctgccccgg 


agaaaagcgc 


tatcaccagg 


56252 


agaacaacga 


tgggtgacat 


gggcatgcgc 


ggagaccttg 


aagtagcaga 


gtggaaccct 


56312 


gcattgatac 


ttggttcagc 


cccatgttgc 


tcctggtcgg 


ggttcactcg 


aagcggcgga 


56372 


cggccttcac 


gtcaaaatag 


ggatagaccg 


aggcgctgtt 


ggagccgtcg 


ttgatctcct 


56432 


tcttgaaccc 


aatagagtcg 


ttctcaatga 


tctgcatgca 


tacagggaag 


cggtcaccct 


56492 


ttggcgtgcg 


tgcttcagtt 


agtcggccgt 


aaacgcgatc 


ctctccgatg 


atgaggcgtc 


56552 


cggagag cgu 
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ggcccgcgcg 


caccgtgatg 


ttcctgctaa 


ccccctcctc 


ggcggtcgcg 


aagaaggtgg 


56672 


ctaccacatc 


ctcgtcatac 


tcaattcgaa 


gcctgctcat 


ggcctcgaca 


gcacctggcg 


56732 


ggcacgcctc 


tggcggaggc 


gtcgggcgga 


cctgggggcc 


agggcaggcc 


atcgctaggc 


56792 
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agcccgcgac 


accgaggccg 


agcgccttgc 


cgggctcgga 


gaatacgccg 


gtgttcttct 


tcgtcgcggg 


gagggcgacg 


gccgctaggg 


cctgagcttc 


cggtgccatc 


catggcgccg 


ggtcgaactc 


gggggatgca 


ggtggtgagt 


cgtgacgtgc 


ggcaggagca 


ggggaaggcg 


cgctcaagac 


caccagcaga 


cccatccagg 


cttgtgccgg 


gggcggcgag 


ggcatggggc 


ctgcctggat 


ggccaccgcg 


tttgagtccg 


gtggcccccg 


acgaggccgg 


gccttgcgct 


ccgcgtgcgg 


caccgccgcg 


agcgtggtgg 


tcacgtccca 


ggactcgtcc 


gcctgggcca 


cggcgtccgg 


gtggcgctcc 


tcgggctcct 


gggcctcggg 


gacgcgtgga 


ttgagcacat 


cctccacccc 


tgcggcctcg 


tccgggagga 


gaacgacgcc 


cagcgagtac 


aggtcatccg 


cgtccccgtg 


ctcctgctgg 


aagcgccagg 


gtggcagcac 


gccgccggtg 


atgctggggg 


ccaccactgc 


ctctccgtcg 


gaggtgcgct 


gcaccacctt 


cgcccggtgc 


gcggcgccca 


cgacttctcg 


ggccgagggg 


ttctcctctc 


cgtagtccat 


gacgatgaag 


aaggattgag 


ggatgcgcac 


cagattggag 


tgcttgagcc 


actcagccag 


cccccagagt 


gggatgagct 


cgcggtacac 


ggcaccgaag 


cccccggctc 


tgtccgtgcc 


cggggggacg 


acgggatgca 


ggggcgtgtg 


gggtcggcgt 


tcctggacat 


ttccccccgc 


gaagtggcgc 


gggagaaggg 


tggtgagctg 


tatgctcgga 


gtgccgccca 


ctcatgtacg 


cgaagctctc 


ccagcagatc 


gcgggcctgt 


cgcaggccca 


ggtggccgag 


cgcctggagc 


gtggccggtt 


gctgcccagc 
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caacggt tec 


crc c c c crcraacf 


cactaccrgct 


56852 


tcatcttcac 


crcraacrc cttc 


tecteggaga 


56912 
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getqeggett 
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ccacttccta 
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57272 
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cccraacrfccca 
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crefcefc ccfcfcc 


acrcrfc c t c cat 


57932 
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cacaccacac 


acctfcgagca 


57992 


tggcccatac 


gtccaaccgg 


cgcccttcca 
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58532 


ggaaccaatg 


tccgcgccgc 


ccggcatcgg 


58592 


gccatccacg 


tgccgacgct 


ggtgttcagc 


58652 


ctccccacgc 


tggtgggcct 


gtgcggtgtc 


58712 
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ctccgtgtcc 


ccgtggactt 


catcctgggc 


ggcctggccc 


tggaggttcc 


cgcccccgat 


gggtccggcc 


aataggggcg 


ccgcgtgacg 


gggccctgct 


cgtggatcga 


catggccgcc 


gagatgcggc 


aggaaggggc 


gtccccgagg 


ettgggcett 


cctgcccgcg 


ccaaacctgg 


ttctcggtgt 


cctggtaccc 


egaegcatga 


atgaagagct 


ggcgacccgt 


attggaagtg 


ccgcccgtga 


ggcccggacg 


cagctcgggc 


tcacgcaggc 


ggaggtggcc 


gagaagctgg 


gcctcgcgca 


catggtctac 


agccgcctgg 


agegegggaa 


gatgctgccc 


agegtccaga 


cgctgctgcg 


gatgtgctcg gtgetgegea 


tctcctcgga 


cgagttgctg 


gggatcgegg 


acgcggagga 


ggggacgaag 


tegggcaagg 


gggagegagg 


geagggeggg 


gcgcccaagc 


tgcgccagct 


caccggactg 


gegegcaaga 


tggacgagga 


ccagctcgat 


gccctcgtga 


aggtggccca 


ggtgttgttg 


cgctgaaacg 


acgcgggccc 


acaccggggc 


ccgcttgctt 


caaggaatgc 


accgccgccc 


gttctcactc 


ttcgttcggt 


gegggegteg 


gctcgtcggc 


gtccaggccc 


teggggaaga 


tgccttcttc 


gtccagttcc 


acccggtcgc 


geaggaegga 


cgcctccgac 


agcatccgcg 


ccagctcctc 


gctcgcctcc 


ttccggccga 


agggctgtga 


ctccgccgcg 


agcgccgctc 


gcagcgcatc 


gcgcatctcg 


gcggccgtgg 


cgtagcgctc 


ggagagcttt 


cgctggaggg 


ccttgtggac 


gatggccttg 


agcgcgtccg 


gcaggcccgc 


catcgcgttc 
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eggggeggta 


gcggttgacg 


agegegatea 


tetgegtgag 


gggcaccgag 


ggcgtctcct 


ccgtcttcac 


gtccacggtg 


gcgttggggg 


ccttttcgtc 


ctccacgtcg 


aagaggtgct 


tgcacgtcag 


catctccatg 


aggacgaggc 


ecagegagaa 


gatgtcggac 


cggccatcca 


tgggcttacg 


gttcaggtac 


tegggegagg 


cgtacgccac 


gtcgcccttg 


agcaggaggc 


ccggcgtctc 


ctccctgccc 


accatgagcg 


agtaggtagc 


gccgaagtcc 


gtcaccttca 


cctcgccgct 


gcgggccacg 


cggatgttcc 


ggggcgccac 


gtcgcggtgg 


atgatgecca 


gcggccggtt 


ctccgaatct 


ctcagcgtgt 


gcgcgtggtg 
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gccacctcgg 


cggcgatgta 


gagggegaag 


ggegeggaga 


gcggcttctc 


gcgcatggtg 


gcgaggctga 


tgatggtgtc 


cagcgtgggg 


ccgtccacgt 


acteggegat 


gacgtgcggc 


eggteggcat 


ggattttcag 


gtgatggacc 


tgggcgatgg 


ccgggtggtg 
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gagagctgga 


cctcctcgat 


cagccgctgg 


cagcgctcga 


aagtgacggg 


autgcgcagg 


cgcttgatgg 


tgacgagccc 


ggegagtcca 


tgggggaggt 


gacgctcggc 


en crcracrcacci 


acctcgccgt 


taccccgccg 




eggagggtet 


cgtacgtggt 


rfgric- ucucitg 


gtgaaccgga 


gtcgggggcg 


ggcgcccttc 


aggteggagg tgttctccgg 




tegatctgeg tttcgtcggt 


cggcgtgctg 
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ctcggctgag ggggacgggg ccggtggctc tcgcggtccg aggtctacta ttcataaacc 60632 

caaagggtat tacaagcgac cacagggctc ggcagggtga gcgcggcgga gcgcctgcct 60692 

ggtgctcgcg ctctgcctct gctccgtgcc ctgagtccca cctctaagca ctggccgcag 60752 

gagcttggcc ttcgcacgtt gaaggtgctg gcggaccacc tgggagtgtt gccgctggat 60812 

ctcctgacgt ttcccgagga tgacgagcgg caacgcctcg tggaccgcat ccgcacgatg 60872 

tcctcggccg agatcaagaa gctcttgaag gcgctcgcgg accaagacca gcggtagatg 60932 

gcgtccgaca tcggccagca ggcgcgaggc catcaatccc aggagaacgc cggtctcttc 60992 

gggggcaccg gctcggtgat ttcccctacg agcacacgac catcgtgatc accgggggag 61052 

ggtccggcaa cctggtcctg cggcaccacg cgcggctgta ccggctccaa tgctggcggg 61112 

ccggtcgcgg tcggcgaagg ggcttcggtt ccgtttgcca catggcgatc tccaatgccg 61172 

tggccgttgg ggtcgctgac ggttgccgcc cggcaaccac cagggaccaa cgcgctcctt 61232 

tcgagcatcg accggagcgt cgcgttttca tcccgaagcc gctgaagctc tgcgtggatc 61292 

tccttctctt ttcgctggcg gtgcagccgt agggtacgcg caggcagccc tggcgcggcc 61352 

gtggccgtga actgctccga ggtcagcccg gacctctccc aggccgcgag catcgcgtcc 61412 

aggtactccg tgctccagcg ggtgcctcgg ctcatacacg gaaggttgcc gtccggtgta 61472 

cgtcgcgaat cccctggccg tccacggcgc aggatcgcgc agcggcgttg atgcccgtgg 61532 

ccctcggggc gcaccgtccg aggtgcgcag acacattcga tatattcctt gtagccacac 61592 

cgaacccggc tccgctccag ggcgccgcaa cgacgtagcc agaatcgata caagttcaag 61652 

agccaccctg catttcaagg cgtcgaagca cggcttcggc gttgatggga gaggtgtacc 61712 

ca atg gga cat gag gac teg gac aag gca ctg ccc gac teg aag gcg 61759 
Met Gly His Glu Asp Ser Asp Lys Ala Leu Pro Asp Ser Lys Ala 
2665 2670 2675 

aac gag gtt gaa tec age cgc aac gag get gag tgg aac gag ctt 61804 
Asn Glu Val Glu Ser Ser Arg Asn Glu Ala Glu Trp Asn Glu Leu 
2680 2685 2690 

ccg ate teg gcg cga gac tgc gag gac ctg aag aag tec tgc etc 
Pro He Ser Ala Arg Asp Cys Glu Asp Leu Lys Lys Ser Cys Leu 
2695 ' 2700 2705 

ctg ccc teg acg gtg egg aaa etc ggt tgg cga acg gac acg gac 
Leu Pro Ser Thr Val Arg Lys Leu Gly Trp Arg Thr Asp Thr Asp 
2710 2715 2720 

gag aac aac egg aag ttg ttg aac tgg ccc gag gaa tgg aag ggg 
Glu Asn Asn Arg Lys Leu Leu Asn Trp Pro Glu Glu Trp Lys Gly 
2725 2730 2735 

gac get gga ggg ctg gtg etc gtg tac ccg ggg etc aac tac gee 61984 
Asp Ala Gly Gly Leu Val Leu Val Tyr Pro Gly Leu Asn Tyr Ala 
2740 2745 2750 

aag gtc aag ccg cat cgc ccg cgc gtg etc egg ccg aag gag gcg 62029 



61849 



61894 



61939 
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Lys Val Lys Pro His Arg Pro Arg Val Leu Arg Pro Lys Glu 
2755 2760 2765 
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Ala 



cag etc acc etc cca acc gcg acc ttc acg gac cac ate ctg cgc 
Gin Leu Thr Leu Pro Thr Ala Thr Phe Thr Asp His lie Leu Arg 
2770 2775 2780 



62074 



ccg tac acc cag gat gag gtc gaa ggg gag cca gcg aag gtg aag 
Pro Tyr Thr Gin Asp Glu Val Glu Gly Glu Pro Ala Lys Val Lys 
2785 2790 2795 



62119 



aag tac gag get cct ttc ggg tac gcg ccc cag ate ttc gac 
Lys Tyr Glu Ala Pro Phe Gly Tyr Ala Pro Gin lie Phe Asp 
2800 2805 2810 



acc 62164 
Thr 



gcg ate etc tgc ccg atg ctt caa gac gaa gaa gtc ccc atg 
Ala lie Leu Cys Pro Met Leu Gin Asp Glu Glu Val Pro Met 
2815 2820 2825 



gcg 62209 
Ala 



etc att gag ggc gag aag aag gcg gcg ctg etc gcg cag gaa ggc 
Leu lie Glu Gly Glu Lys Lys Ala Ala Leu Leu Ala Gin Glu Gly 
2830 2835 2840 



62254 



acc ccg gcg ctg ggg etc ccc ggc ate age atg gee cat gat gtt 
Thr Pro Ala Leu Gly Leu Pro Gly lie Ser Met Ala His Asp Val 
2845 2850 2855 



62299 



gag gee egg cga cga age tgg gat gac ctg acg gac gag tgg gta 
Glu Ala Arg Arg Arg Ser Trp Asp Asp Leu Thr Asp Glu Trp Val 
2860 2865 2870 



62344 



ccg cac ccc gag gee gca tea tac ctt gca cag ggc cgc acg gtg 
Pro His Pro Glu Ala Ala Ser Tyr Leu Ala Gin Gly Arg Thr Val 
2875 2880 2885 



62389 



ttc gtc tgc ggc gat gcg ccg gac atg gac ggc acg aac atg aac 
Phe Val Cys Gly Asp Ala Pro Asp Met Asp Gly Thr Asn Met Asn 
2890 2895 2900 



62434 



gtc gtt cgt gaa gtt gtg egg ate gcg aag ggc ctt tct gcg cgc 62479 

Val Val Arg Glu Val Val Arg lie Ala Lys Gly Leu Ser Ala Arg 
2905 2910 2915 

gga tgt gac cct cag att ccg tac gtg gac cac gac cca cag eac 62524 

Gly Cys Asp Pro Gin lie Pro Tyr Val Asp His Asp Pro Gin His 
2920 2925 2930 



gag aaa tec ggt gtg gat gat tgg gtg cat cgc gag atg aag aag 62569 

Glu Lys Ser Gly Val Asp Asp Trp Val His Arg Glu Met Lys Lys 
2935 2940 2945 

ttg ctt gee get cat cac cac aag acg gaa gca tgg get cgt cag 62614 

Leu Leu Ala Ala * His His His Lys Thr Glu Ala Trp Ala Arg Gin 
2950 2955 2960 

ttc ctg ate gtg cag tac ttg gee tct gcg cgc ccg gcg ctt cct 62659 

Phe Leu He Val Gin Tyr Leu Ala Ser Ala Arg Pro Ala Leu Pro 
2965 2970 2975 

ccg gca etc tgt gac tgg etc gtg gaa cag cat gag gcg cac ggt 62704 

Ser Ala Leu Cys Asp Trp Leu Val Glu Gin His Glu Ala His Gly 
2980 2985 2990 
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gtg aag gag ate gag ate aag cgc get etc gtc tgg tgc teg 
Gly Val Lys Glu lie Glu lie Lys Arg Ala Leu Val Trp Cys Ser 
2995 3000 3005 
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62749 



gtt tgg ttc tec ace gag cat ggc ate ttc aag aag tgg gtc ggg 
Val Trp Phe Ser Thr Glu His Gly lie Phe Lys Lys Trp Val Gly 
3010 3015 3020 



62794 



ace ttg aag agg aag aag gtc tgc aag ccc gag gtc ate gac cga 
Thr Leu Lys Arg Lys Lys Val Cys Lys Pro Glu Val He Asp Arg 
3025 3030 3035 



62839 



ttg gtg cag gcg ttg ccg agg gtc gag aac cgc ttc gag ccg cgc 
Leu Val Gin Ala Leu Pro Arg Val Glu Asn Arg Phe Glu Pro Arg 
3040 3045 3050 



62884 



aca tgg gtt caa cac tgg atg gag aag aac ggc gtc aca ttc gtg 
Thr Trp Val Gin His Trp Met Glu Lys Asn Gly Val Thr Phe Val 
3055 3060 3065 



62929 



tac gcg age gat ggc ttc cag att ggg ggc egg atg ttt cct gcg 
Tyr Ala Ser Asp Gly Phe Gin He Gly Gly Arg Met Phe Pro Ala 
3070 3075 3080 



62974 



tec gag ttc ttg tec cag ate gag ate gag tec acg aac gag cgc 
Ser Glu Phe Leu Ser Gin He Glu He Glu Ser Thr Asn Glu Arg 
3085 3090 3095 



63019 



aca tgc ate age gtg cga aat etc ggt aac gee gtg cgc ctg tgg 
Thr Cys He Ser Val Arg Asn Leu Gly Asn Ala Val Arg Leu Trp 
3100 3105 3110 



63064 



gag gcg gac caa cgc egg ggg ate gtc gag cag tat cgt gag aag 
Glu Ala Asp Gin Arg Arg Gly He Val Glu Gin Tyr Arg Glu Lys 
3115 3120 3125 



63109 



ttg aag ttc cgc gag gac gga cgc cga ggt gac gee gag gag ttc 
Leu Lys Phe Arg Glu Asp Gly Arg Arg Gly Asp Ala Glu Glu Phe 
3130 3135 3140 



63154 



att cga gca acc acg gga cgc ctt gac ccg gta gat etc gtg gtc 
He Arg Ala Thr Thr Gly Arg Leu Asp Pro Val Asp Leu Val Val 
3145 3150 3155 



63199 



gtt ctt cac ttc att tgg cag gtg aag egg aag ttg ttc ggg aag 
Val Leu His Phe He Trp Gin Val Lys Arg Lys Leu Phe Gly Lys 
3160 3165 3170 



63244 



ccg gtc aag gac cac ctg atg ccg ate ttc gtc ggc aaa cag egg 
Pro Val Lys Asp His Leu Met Pro lie Phe Val Gly Lys Gin Arg 
3175 3180 3185 



63289 



ggc ggg aag act acg gee ate aag age ctg eta cag cct ctt ggc 
Gly Gly Lys Thr Thr Ala He Lys Ser Leu Leu Gin Pro Leu Gly 
3190 3195 3200 



63334 



gat etc gtt ggg ttc ccc gee gat etc cag ttc ctg acc gat gag 
Asp Leu Val Gly Phe Pro Ala Asp Leu Gin Phe Leu Thr Asp Glu 
3205 3210 3215 



63379 



cgc cag cag ttc egg ttg act cgc tgc tac ggc atg ttc ttc gac 
Arg Gin Gin Phe Arg Leu Thr Arg Cys Tyr Gly Met Phe Phe Asp 
3220 3225 3230 



63424 
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gaa atg gcg aag gca gag egg gta gcg tct gac gtg etc aag aac 
Glu Met Ala Lys Ala Glu Arg Val Ala Ser Asp Val Leu Lys Asn 
3235 3240 3245 

cgc ate acc gec gag egg ate gag tac egg gtg etc ggt gtc cag 
Arg lie Thr Ala Glu Arg lie Glu Tyr Arg Val Leu Gly Val Gin 
3250 3255 3260 

aag etc cag acc ggc acc aac aac gee acc ttc ate ggt gcg tec 
Lys Leu Gin Thr Gly Thr Asn Asn Ala Thr Phe lie Gly Ala Ser 
3265 3270 3275 

aac aag cag gtg aag gag ate gtc gtc gat ccc acg gga atg cga 
Asn Lys Gin Val Lys Glu He Val Val Asp Pro Thr Gly Met Arg 
3280 3285 3290 

t QC °f a ° tg 9aC tgt Ctg ccc cag at 9 aac tgg acg gca 

Arg Phe Tyr Gin Leu Asp Cys Leu Pro Gin Met Asn Trp Thr Ala 
3295 3300 3 305 

ctg gag cag ate gac ttc ctg gee atg tgg acg age gtg gac gag 
Leu Glu Gin He Asp Phe Leu Ala Met Trp Thr Ser Val Asp Glu 
3310 3315 3320 

aag gcg gag age ccg etc etc ggc caa ttg gcg cag gtc cag gee 
Lys Ala Glu Ser Pro Leu Leu Gly Gin Leu Ala Gin Val Gin Ala 
3325 3330 3 335 

cgc cag gat gag att cga gee aag gac teg gtg gaa gat ttc ttc 
Arg Gin Asp Glu He Arg Ala Lys Asp Ser Val Glu Asp Phe Phe 
3340 3345 3350 

gag cgc egg tgc gcg cag egg gac cag tgg ate aag gee acg gcg 
Glu Arg Arg Cys Ala Gin Arg Asp Gin Trp He Lys Ala Thr Ala 
3355 3360 3365 

ctg tac ggc cac tac gtc gag gac ttg aag tac cag ggc cgt gac 
Leu Tyr Gly His Tyr Val Glu Asp Leu Lys Tyr Gin Gly Arg Asp 
3370 3375 338Q 

^ g 5 tg g ? g ^ C tCg CtC acg aag ttc ggt gaa cgc atg aag gag 
Arg Met Ala Phe Ser Leu Thr Lys Phe Gly Glu Arg Met Lys Glu 
3385 3390 3395 

rl?, a? C %V Sf 3 g ? g aa ° aCg ggc tgg aag cgg age gat gga ate 
Leu Ala Gly Glu Glu Asn Thr Gly Trp Lys Arg Ser Asp Gly He 
3400 3405 34 10 

f ag I a ° f gg 9tC aCg gtg aaa cgc tgg 2*9 ac <= gag ate gac ggg 
Lys Tyr Arg Val Thr Val Lys Arg Trp Glu Thr Glu He Asp Gly 
3415 3420 3425 

?, tg 5 tg g f 9 agC atC gCC gct ggc gct cg ST atg gac ctt gcg cac 
Val Met Glu Ser He Ala Ala Gly Ala Arg Met Asp Leu Ala His 
3430 3435 3440 

ccg tga ttggaacgga tgcgttccga ctgeggegat gccttccatg ctgtctgctt 



63469 



63514 



63559 



63604 



63649 



63694 



63739 



63784 



63829 



63874 



63919 



63964 



64009 



64054 



64110 



cggtctcccc ctggttgcag agagaggtgg aaggatcgga agggatggaa ggctatgggg 64170 
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aagtcaggac 


gatgacgccg 


cagcacgagt 


aaagggctga 


agtagccggt 


gctgtgaccc 


ttccgggcca 


tccatggcgt 


ccaacggggc 


tcaggtgccg 


ggccaagcgc 


ctcccgttcg 


ccttcaccgg 


cacggaggcg 


ggcagggcga 


gcgcccggaa 


ggacctcgga 


gagattcgat 


acattcagaa 


tagggacgga 


ctggtggcca 


ccggctcgat 


ccacaccttc 


gacaccatcc 


acggcctccg 


acggagaccg 


gttgagtaca 


agtccatgat 


cgagaacttc 


agctatatcc 


gtacaggctc 


cttgcttgct 


cggccggagc 


caccgttgga 


cctcattcct 


attcccattc 


cagtctttcg 


acctgccgtg 


cccattgagg 


accacgagcc 


cgcacccgtg 


gaatgcgcca 


tcctccagga 


actggccgag 


atcaagcgcg 


aggttcgtgc 


gcttcgcgct 


caatcggtca 


cacagaacgc 


ctcggctgtg 


atgctggagc 


gcgcgtcggc 


aatgctcggc 


tgcaagcggt 


cacaggtctt 


caagctcctg 


gagcaaggcc 


ggttggagcg 


ggcaccgaag 


atcggacggt 


ccgtgatgat 


cacggtggcg 


agcatcgagg 


cactgctcgc 


ggagggcgcc 


cctcggaagg 


gaggcaatcg 


gcctcaacaa 


accaagcagc 


ctgaacgagt 


ccgcaccaag 


gagcggcgtg 


ggaagcagga 


gggaaccgca 


gccggagcgt 


atccgaaccc 


aggtgttgcc 


atccgcaagc 


tcacgatttg 


agcctcgttg 


aaagtcgccg 


taacgaagcg 


gccgaagctc 


acgagcgagg 


gttttggtgc 


ccctcgacat 


ggcccaacgg 


gatagaggag 


gcagaacttc 


ttgtcctggc 


atccgcagcc 


ggatcgacac 


gctctaaccc 


cccgacatcg 


tgacaagaat 


gatccggcag 


tgatcgaggc 


gtagttcgcc 


gcatcctact 


cctcgatcaa 


ggaagagact 


ttgctggccg 


cgcccgaggg 


gccgatggtc 


tacttggcgc 


gcaggtgcag 


cgtgtctcct 


gaccactcca 


ggcgcgtgcg 


gtactgcgcc 


tccaactccg 


tgttgagcac 


cttcgtcaac 


tcttccaagt 


gctccctatc 


gcaggtccgg 


ttcgggtaca 


ggtccgccgc 


aacgatgttg 


atggcgtcga 


tcggaatgga 


cgcgggaata 


gtcccgacgg 


ggtgagtggg 


cgtggaaggg 


agcagcgagc 


gcagcacgcc 


tccgaactcc 


gcaaccgtga 


tgggggctct 


ggtcggcttg 


gctccggcaa 


ttgcggcgcg 


agtctccggg 


aaggccgcct 


cgacgaacgc 


gatcgcgacc 


ctccgtcgcg 


tttctagatc 


ccacacccca 


aagttgaagg 


gcgcctgcat 


gagggcgcgg 


ccatggagat 


cggttacgac 


tcctgcactg 


ccgttgagca 


tcgaggacac 


gcgcaggtgg 


aggtacccga 


caacgttgcc 


cgccacgtcg 


gtgatggtgc 


acttcccccc 


gaaactagga 


ggttcggtta 


cggccaggat 


ctgcatcgaa 


tggctccttt 


gtgggaggcg 


tgcggtaatc 


acctggagcg 


ttcggcgcaa 


taagaacgtt 


gaaggctccg 


gcctggggcc 


tgacgcatca 


acgaacctcc 


tcgtactgat 


ctccgaactc 


tctggttcat 


gagcctcacc 


ccgcaggtgc 


ccgagctatg 


cgcaactccg 


atggatcatt 


cgaatgctcc 


aggcggatgg 


ggacgacgta 


catcggcggc 


actgtggcgc 


tgacgtagtg 


gagggccgtg 


gtccgcgtac 


tgctgtggtt 


gagcgcctga 


gcgacggctt 


ccagggacag 


gccaccgccc 


ttcgggttgt 


agatgctgcc 


ctgctcgacc 



64230 
64290 
64350 
64410 
64470 
64530 
64590 
64650 
64710 
64770 
64830 
64890 
64950 
65010 
65070 
65130 
65190 
65250 
65310 
65370 
65430 
65490 
65550 
65610 
65670 
65730 
65790 
65850 
65910 
65970 
66030 
66090 
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agccacgttg cgaagctgtg ccggatcgcg ccgaagtgga cgaactccag gctcgtgcgc 



66150 



gcgcacacgc gctccacggc ctcgcggatg gtctgccgat ccggcgctga accccgggct 



66210 



tgaaggcgct gggcggcggc gagcccctgc acatcgagcg acaaggtgaa cggcttcccg 



66270 



gtcttgtgct tgaacgtgac ggtggcggcg atctcgccct ggtccttcag caccgcgatc 66330 

ttgccgtcac cgctcgcgag ccggtccacc tcggtgccgt gcatcccggt cttggcctga 66390 

aggcacagca catcgcggat ggcctgcgag ccgagggccg cgtagagctg ctggacgtgc 66450 

tcgatgctgt agcccttgag ttcgtggttc cggcgggcag gggggacctt caggaagcgt 66510 

ccggggttca ccatcgggtc cagttcgccc gagtcctgaa ggaacgaggt . gaaggacttg 66570 

aagacgatga tcctccactt cttgccggtg tcccaggtgg ccaggatctt cttcaacgcc 6663 0 

ggtccggtca ccgcgcgcag gtccttgtcc gcgagcgccg ttgcccacgc ggccaggtag 66690 

ctgcgcgtgc tcttcaggta ctcgtcggac ctgccgctgg acttcagatg ctggaggaac 66750 

cgccgggcac gatcctcgtc caggaacacc gcctgcgcgg cgtcctcctg ggcctgttgg 66810 

gcctcgtctt ggcgctcctg gctcgcggtc cggtacgcgg ccgggttgcg gcggaaggcc 66870 

gccagctcag ccagggcctc ggcctcgctc cgcacgtcca gcgtgacgtt gtacgcgacc 6693 0 

ccggcgacgg tcttgcggat gatccagcgc gtgcttccgt cggtggccgt gtacgtgcgg 66990 

cctccgatcc acttcccgct ccacttcgtc atgcgcggct cccgtacccc cctcacttgg 67050 

gtaaaacttg ggtacgagag gtacaggcgg gtggggcgtc ctgcaacccg atcagggccg 67110 

cgtcagaaaa ccccaaggaa aacatggcgg tagagcatga gaatcgaact caccagggac 67170 

gcctctcagc gcccctcacc ggttttgaag accgggccgg ccaccaggtc cggaggccct 6723 0 

accaccgtcg attactgccg caccccacgg gccccgtcaa ccaacgcccg gctcaccgtg 672 90 

tcttccgcgc cgccttcacg ggcttcaacc ccggcaatcc gttcaccatc gtggacacca 673 50 

acagccgcgc cagctcctcc ggcggcgagc ccttgaagcc cgtacacgtc agccgcagcg 67410 

ccacgatgcc gtgcaccccg gcccacagca cctcggccag cttcgagggc tcggcgtcct 6747 0 

cggcaatcct ccccgccgcc ttcaggtcct cgaacacctt caccagcacc ccgaaggacc 67530 

gcggccccgc gccgtcctcc ggcgcgtccc tgaacagcgc cgtcgacagc ttcgggtcct 67590 

ccaigaagat gagccggtac gtctccggct gctccagccc gaacttcacg taggcctcgc 67 650 

ccagccggta cagccgctcc accggctcct ccacggatgc cgccggctcg aacgcctcca 67710 

gcaggtcctg gaacccccgg atgcacagct cccgggcaat cgcgtcccgg ttctcgaagt 67770 

rcaggnagag cgtcgccggc gcgtactcca ccgcgtccgc cagcttgcgc atcgagaggg 67830 

caecaaagcc ctctcgcgtc accatgtcct tcgccacctg gaggatctgc tcgcgcagct 67 890 

ccy.'-vy- Lg gcgctccttc cgctccgaaa tccccatatc cccactgtag ggcttgacgt 67950 
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cctgaacggc cagcagtaga tgaacaccgt tcacaatgaa cgacgttcat aaaacgctcg 68010 

ccgttcggaa cccagggagc ggacc atg gat acg gac aag gtg gcg ttg ttt 680 62 

Met Asp Thr Asp Lys Val Ala Leu Phe 
3445 3450 



gga gcg teg ggc ccc ate gga cac age ate gtg aag get ctt cgc 
Gly Ala Ser Gly Pro lie Gly His Ser lie Val Lys Ala Leu Arg 
3455 3460 3465 



68107 



gag cag ggg agg gcg tac egg gtg gtt ggc cgc teg cgc gec teg 
Glu Gin Gly Arg Ala Tyr Arg Val Val Gly Arg Ser Arg Ala Ser 
3470 3475 3480 



68152 



ctt cag teg cag ttc ggc gcg gac ccg aag gcg gag ate gec acc 68197 

Leu Gin Ser Gin Phe Gly Ala Asp Pro Lys Ala Glu He Ala Thr 
3485 3490 3495 

tgg aac ccc gac gcg ccg gac acc ate cgc gec gec gec egg ggc 68242 

Trp Asn Pro Asp Ala Pro Asp Thr He Arg Ala Ala Ala Arg Gly 
3500 3505 3510 



ate cgc acg etc ate tac atg gtc ggc gtg gac tac tgg cgc ttc 
He Arg Thr Leu He Tyr Met Val Gly Val Asp Tyr Trp Arg Phe 
3515 3520 3525 



68287 



cag etc cac ccg gag etc atg cgc cgc acc gtg gac gee gee ate 
Gin Leu His Pro Glu Leu Met Arg Arg Thr Val Asp Ala Ala He 
3530 3535 ~ 3540 



68332 



gec gag ggc gtc gag cgc gtg gtg etc ate ggc acc gtg tac ccg 
Ala Glu Gly Val Glu Arg Val Val Leu He Gly Thr Val Tyr Pro 
3545 3550 3555 



68377 



tac ggc cgc ccc cgc acc acg ccc gtc acc gag gac cac ccc cgc 
Tyr Gly Arg Pro Arg Thr Thr Pro Val Thr Glu Asp His Pro Arg 
3560 3565 3570 



68422 



gag ccg cac acc ttc aag ggc cgc atg cgc aag gag cag gag gac 
Glu Pro His Thr Phe Lys Gly Arg Met Arg Lys Glu Gin Glu Asp 
3575 3580 3585 



68467 



ate etc etc gec gcg gac gec gee gga aag etc cgc gee acc ate 
He Leu Leu Ala Ala Asp Ala Ala Gly Lys Leu Arg Ala Thr He 
3590 3595 3600 



68512 



etc cgc etc ccg gac ttc tac ggc ccc ggc ate gac aag age ttc 
Leu Arg Leu Pro Asp Phe Tyr Gly Pro Gly He Asp Lys Ser Phe 
3605 3610 3615 



68557 



etc cac tec gec ttc gtc gec gee gee ' cag ggc aag cgc gee cag 
Leu His Ser Ala Phe Val Ala Ala Ala Gin Gly Lys Arg Ala Gin 
3620 3625 3630 



68602 



etc ate ggc ccc ate gac ccg ccg cac gag tac gtc tac gtg ccc 
Leu He Gly Pro He Asp Pro Pro His Glu Tyr Val Tyr Val Pro 
3635 3640 3645 



68647 



gac gtg ggt ccc acc gtg acg gee etc • atg gac gee ccc ggt gee 
Asp Val Gly Pro Thr Val Thr Ala Leu Met Asp Ala Pro Gly Ala 
3650 3655 3660 



68692 



tac ggc cgc tgg tgg aac etc gee ggc gee ggc gtc acc tea cag 



68737 
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Tyr Gly Arg Trp Trp Asn Leu Ala Gly Ala Gly Val Thr Ser Gin 

3665 3670 3675 

cgc aag etc gtc gag gag ate tac gec cag gec ggc cac etc gcg 

Arg Lys Leu Val Glu Glu He Tyr Ala Gin Ala Gly His Leu Ala 

3680 3685 3690 
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68782 



aag ttc atg acg gcg ggg aag ggg atg etc cgc etc ate ggc etc 
Lys Phe Met Thr Ala Gly Lys Gly Met Leu Arg Leu He Gly Leu 
3695 3700 3705 



68827 



ttc aac ccg ttc atg cgc gag atg gtc gag atg cac tac ctg etc 
Phe Asn Pro Phe Met Arg Glu Met Val Glu Met His Tyr Leu Leu 
3710 3715 3720 



68872 



acc gag ccc gtc etc atg gac gac teg gcg ctg cac aag ctg ctg 
Thr Glu Pro Val Leu Met Asp Asp Ser Ala Leu His Lys Leu Leu 
3725 3730 3735 



68917 



ggc age gtg cgc aag acg ccc tac age gag ggc ate cgc cag acg 
Gly Ser Val Arg Lys Thr Pro Tyr Ser Glu Gly He Arg Gin Thr 
3740 3745 3750 



68962 



ctg gaa gee acg egg gec teg etc gtc agt cca gee ccc get gec 
Leu Glu Ala Thr Arg Ala Ser Leu Val Ser Pro Ala Pro Ala Ala 
3755 3760 3765 



69007 



aca ggt cgt cct cat cca gee cca tga ggtgaccgac ctcgtgcatc 
Thr Gly Arg Pro His Pro Ala Pro 
3770 



69054 



aeggtgatte 


caatctgctc 


gatgagttcc 


tcgcgcgtcc 


tegegaageg 


ctccaggttg 


69114 


cgctggtaga 


gcacgatgga 


cgeggggage 


teatacgegt 


tcatcacgct 


ccgctcgccc 


69174 


acgggcgtgc 


cgcggaacac 


cccgaggatg 


cteggggaca 


gcggcgggtc 


ctgccccagc 


69234 


aaatcctcgt 


ccgagggcag 


gtcctccacg gcgatggtga 


cgttgtccag 


gtactgette 


69294 


gcgtggcggg 


gcagggactt 


caccgcgtcc 


tccaccgcgc 


ggtcgaactc 


cgcctcggcc 


69354 


agctccaccg 


gcggggggaa 


ctcctcgggc 


accagcgccc 


gcgccttgtc 


gaagcgccgc 


69414 


ttcgcctcct 


tctcgtcccc 


gcgccgctcg 


gegatgagee 


ccaggtagtg 


gtgcgcccac 


69474 


gggtcgtccg 


gcgcgtcctt 


cagcaccttc 


tegaaagegg 


ccttcgcctc 


gtcgaagegg 


69534 


cacagctcga 


acagegegat 


gccccgctcg 


acctgcgcgt 


ccagcgagcg 


cggcatgtgg 


69594 


ccgagcgccg 


cgtccaggct 


caccagcgcc 


gtggcgcact 


cgcccatctg 


gttgagcccc 


69654 


atgccctcca 


ggaggaggaa 


ctcgtacagc 


atctccacgt 


cgtcggcctt 


ctgggccagg 


69714 


cgccggcctc 


gcgcgcacag 


ggcgagcccc 


tccgccaccg 


cctcccggtc 


ctccccggcg 


69774 


cggcacacca 


ggcagtccgc 


ggcgctcagc 


aggatttcca 


ggtcctccgg 


cgccaccttc 


69834 


agggcctgcc 


egaaggcett 


cccggcctcc 


tccagccgcc 


ccaattccac 


cagggecgaa 


69894 


gcccggaagc 


ggagggcetc 


gggaagctcg 


ggcgcgtccg 


ccagcaggcc 


ctccgccccg 


69954 




cctcgaagtc 


cccggcctcg 


aaagcgtccg 


ccaccgcatc 


cagccgagcc 


70014 



WO 2004/022586 ^VCT/£P2003/009780 

~ 185 ^ 

tccacgccac ccaccccctc tcgcttcgcg gtccgcttcc ccatgggccg gcacatatga 7 0074 

aggcacgctg tgcgttgtca acgacgggca cggctgttag gttccggcgc ccgtccatgt 70134 

gtgacggccc gccccgtgaa catccttgtc gtcgatgacg atctcgagct gtgcaccatg 70194 

ctctctcgct tcctggagat gcatggctac acggtctact cggcggcgga tgccctgcag 70254 

gcgctcgaca tcctggagcg caaccaggtg ggcatggtca tcaccgatta cctcatgccc 70314 

cacctggacg gcatccactt cacggagatg ctcaaggccg acccccgctt ccaggccatt 70374 

cccgtgctgc tgatgacggg cgcccaggac gagggcgtca ccgaccgggg cctgcgcaag 70434 

ggcgtggcgc tgacgctgcg caaacccctg gacatgggtc agctgctcaa cctcgtgcgc 70494 

ttcgccgagt agccccccgg ccagacggcc ccgaggggcc ctggccccgt ccgctcgccc 70554 

acaggcacgc ccttcctggt tgacatcatt tcgctggccc ttatgttggc gctcgccatg 70614 

gccgagtgct aacggccagc gtcgtcacag ttaaatcccc cagtaatttc aggaggttgc 70674 



g atg gca gcg aag gag att ttc ttc cac cag tec gcg cgt gag gec 
Met Ala Ala Lys Glu lie Phe Phe His Gin Ser Ala Arg Glu Ala 
3775 3780 3785 

ate ctt cgc ggt gtc egg ace etc gcg gat gee gtg gcg gtg ace 
lie Leu Arg Gly Val Arg Thr Leu Ala Asp Ala Val Ala Val Thr 
3790 3795 3800 

ctg ggc ccc aag ggc cgc aac gtg gtc ate gag aag age ttt ggt 
Leu Gly Pro Lys Gly Arg Asn Val Val He Glu Lys Ser Phe Gly 
3805 3810 3815 

teg ccc acg ate ace aag gac ggc gtc ace gtc gee aag gag ate 
Ser Pro Thr He Thr Lys Asp Gly Val Thr Val Ala Lys Glu lie 
3820 3825 3830 

gac etc gac aac aag ttc gag aac atg ggc gcg cag atg gtg aag 
Asp Leu Asp Asn Lys Phe Glu Asn Met Gly Ala Gin Met Val Lys 
3835 3840 3845 

gag gtc gcg teg aag acc tec gac aag gee ggc gac ggc ace acg 
Glu Val Ala Ser Lys Thr Ser Asp Lys Ala Gly Asp Gly Thr Thr 
3850 3855 3860 

acc gee acg gtg ctg gcg cgt gee ate tac gag gag ggc etc aag 
Thr Ala Thr Val Leu Ala Arg Ala He Tyr Glu Glu Gly Leu Lys 
3865 3870 3875 

ctg gtg gee gee ggc cac age ccc atg gac etc aag cgc ggc ate 
Leu Val Ala Ala Gly His Ser Pro Met Asp Leu Lys Arg Gly He 
3880 3885 3890 

gac aag gee gtc gag gtc gtc gtg gag gag ctg aag aag ctg tec 
Asp Lys Ala Val Glu Val Val Val Glu Glu Leu Lys Lys Leu Ser 
3895 3900 3905 

aag ccc acg gcg gac aag aag gee ate acc cag gtg ggc acc ate 
Lys Pro Thr Ala Asp Lys Lys Ala He Thr Gin Val Gly Thr He 
3910 3915 3920 

tec gee aac ggt gat gag acc ate ggc tec ate ate gcg gac gcg 



70720 



70765 



70810 



70855 



70900 



70945 



70990 



71035 



71080 



71125 



71170 
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Ser Ala Asn Gly Asp Glu Thr He Gly Ser He He Ala Asp Ala 
3925 3930 3935 

atg gag aag gtg ggc aag gag ggc gtc ate acc gtc gag gag gec 
Met Glu Lys Val Gly Lys Glu Gly Val He Thr Val Glu Glu Ala 
3940 3945 3950 

aag ggc ctg gag acc aac etc gac gtg gtg gag ggc atg cag ttc 
Lys Gly Leu Glu Thr Asn Leu Asp Val Val Glu Gly Met Gin Phe 
3955 3960 3965 

gac cgc ggc tac gtc tct ccg tac ttc gtg acg aac cgc gag cgc 
Asp Arg Gly Tyr Val Ser Pro Tyr Phe Val Thr Asn Arg Glu Arg 
3970 ^ 3975 3980 

atg gag gtc gtc atg gac gac ccc ttc ate etc ate age gag aag 
Met Glu Val Val Met Asp Asp Pro Phe He Leu He Ser Glu Lys 
3985 3990 3995 

aag gtc teg teg atg cag gac atg att ccc ate ctg gag cag gtg 
Lys Val Ser Ser Met Gin Asp Met He Pro He Leu Glu Gin Val 
4000 4005 4010 

gcg cgc tec ggg aag ccg ctg etc ate ate gee gac gac ate gag 
Ala Arg Ser Gly Lys Pro Leu Leu He He Ala Asp Asp He Glu 
4015 4020 4025 

ggc gag gcg ctg gee acc ctg gtg gtc aac aag ate cgc ggc gtg 
Gly Glu Ala Leu Ala Thr Leu Val Val Asn Lys He Arg Gly Val 
4030 4035 4040 

ctg aac gtg gee gcg gtg aag gcg ccg ggc ttc ggt gac cgc cgc 
Leu Asn Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg 
4045 4050 4055 

aag gag atg ctg aag gac ate gee acc ctg acg ggc ggc atg gtc 
Lys Glu Met Leu Lys Asp He Ala Thr Leu Thr Gly Gly Met Val 
4060 4065 4070 

gtc age gag gag ctg ggc cac aag tac gag aac ctg acg etc aac 
Val Ser Glu Glu Leu Gly His Lys Tyr Glu Asn Leu Thr Leu Asn 
4075 4080 4085 

gac ctg ggc cgc tec aag cgc ate acg gtg gac aag gac aac acc 
Asp Leu Gly Arg Ser Lys Arg He Thr Val Asp Lys Asp Asn Thr 
4090 4095 4100 

acc ate gtc gac ggc gee ggc acc aag age gag ate gag ggc cgc 
Thr He Val Asp Gly Ala Gly Thr Lys Ser Glu He Glu Gly Arg 
4105 " ~ 4110 4115 

ate aag etc ate cgc teg cag ate gac acc gtc acc age gac tac 
He Lys Leu He Arg Ser Gin He Asp Thr Val Thr Ser Asp Tyr 
4120 ^ 4125 4130 

gac cgc gag aag etc cag gag egg ctg gee aag ctg gtg ggc ggc 
Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala Lys Leu Val Gly Gly 
4135 4140 4145 

gtg gee gtc ate aac gtg ggc gee gcg acc gag acc gag atg aag 
Val Ala Val He Asn Val Gly Ala Ala Thr Glu Thr Glu Met Lys 
4150 4155 4160 
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71215 



71260 



71305 



71350 



71395 



71440 



71485 



71530 



71575 



71620 



71665 



71710 



71755 



71800 
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gag aag aag gcc cgc gtg gag gac gcg ctg cat gcg act cgc gcg 
Glu Lys Lys Ala Arg Val Glu Asp Ala Leu His Ala Thr Arg Ala 
4165 4170 4175 



CT7EP2003/009780 

71890 



gcc gtc gag gag ggc ate gtc cct ggc ggc ggc gtg gcc ttc ctg 
Ala Val Glu Glu Gly He Val Pro Gly Gly Gly Val Ala Phe Leu 
4180 4185 4190 



71935 



cgc tgc ctg ccc gcg ctg gag aag ctg aag ctg ggc ggt gag cag 71980 

Arg Cys Leu Pro Ala Leu Glu Lys Leu Lys Leu Gly Gly Glu Gin 

4195 4200 4205 

gac ttc ggc gtg gag ate ate cgc cgc gcg etc cag gag ccg ctg 72025 

Asp Phe Gly Val Glu He He Arg Arg Ala Leu Gin Glu Pro Leu 

4210 4215 4220 



egg aag ate gcc tec aac gcc ggc gtc gag ggc gcg gtg gtc ate 
Arg Lys He Ala Ser Asn Ala Gly Val Glu Gly Ala Val Val He 
4225 4230 4235 



72070 



aac aag gtc cgc gag ggc cag ggc gcg ttc ggc tac aac gcc cgc 
Asn Lys Val Arg Glu Gly Gin Gly Ala Phe Gly Tyr Asn Ala Arg 
4240 4245 4250 



72115 



ace gag gtc tac gag gac ctg gag aag gcc ggc gtc ate gac ccg 
Thr Glu Val Tyr Glu Asp Leu Glu Lys Ala Gly Val He Asp Pro 
4255 4260 4265 



72160 



acg aag gtc gag cgc ace gcg ctg cag aac gcc gcg tec gtg gcc 
Thr Lys Val Glu Arg Thr Ala Leu Gin Asn Ala Ala Ser Val Ala 
4270 4275 4280 



72205 



teg ctg ctg ctg ace ace gag gcg atg ate gcc gac cgc ccc gcc 
Ser Leu Leu Leu Thr Thr Glu Ala Met He Ala Asp Arg Pro Ala 
4285 4290 4295 



72250 



aag aag aag ggc aag aac ggc ggc ggc gcc ggg atg ccg gac tac 
Lys Lys Lys Gly Lys Asn Gly Gly Gly Ala Gly Met Pro Asp Tyr 
4300 4305 4310 



72295 



ggc ggc gac gac atg gac tac tga gccgctcacg gcggtccggg 
Gly Gly Asp Asp Met Asp Tyr 
4315 4320 



72339 



gecgatgagg 


cctcgggccc 


gccgagccgg 


tgaggtgttc 


cactgatgea 


ggacgcggcc 


72399 


ccggatgcct 


cctcgcgagg 


cgccggggcc 


gtggtctttt 


cgggggtgga 


ggagggcgtc 


72459 


agggcacgeg 


gcccacggcg 


ccgccggtgc 


cgctctcctg 


cgacgtgtag 


aggagggtgc 


72519 


tgccgtccac 


gatgageccg 


cctggtcccg 


tgcccggggg 


ccccaccgga 


tccgacgtcc 


72579 


ccagcgcgca 


cgtgcgcacg 


eggegcagga 


agtccgcgcc 


gtcggcacgt 


gactccttga 


72639 


agtagatggt 


cccattcagt 


tccaccggga 


accacgggcc 


ctggagaccg 


tcggccagca 


72699 


cctcggcggt 


gcctccagcg 


cggggcagcc 


gcaacacccg 


gccggtaccc 


ccaccgccct 


72759 


eggtgacgag 


gaagtgcgtg 


ggggtcacct 


ccagtgaatt 


gecttggagg 


atggtggcgt 


72819 


cccgccgctc 


cgcgggcgcg 


ctgccgtcca 


gcgggacgcg 


gtacagaccc 


gagctgtttc 


72879 


cggctcccgt 


ccccgcgctg 


acgaggaacc 


acacgtccga 


gtccaccact 


cgcgcgccac 


72939 
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gcaccaggga gccgccggga ccggtgtaga gaacctggga gttgccgcca tcggtgtcca 72999 

cccggaccag cgactgccgc cccgtagcga ccaccagcac gtcgcgtccg gcgagcgtgg 73059 

cgcggagcac ctcggtacca ccgacggtga cgttgcgatc cgtcgcgtcg atgggcaggt 73119 

cgccccgctt cccggttgcc ttgtccaccc gccagagccc gtccaggtcg agcacgtaga 73179 

cgctcgtccc atccacggcg atggcgtctg gtgcgcggaa gcccgtggcc agaaccgtgg 73239 

gtggacctcc cgcgcggggc agcctcagaa cctgtcccgg tcccggactc ggctgctggg 73299 

ggttcaggga gt gtg act egg aga tgt aga tgt ccg teg tgt cca ccg 73347 
Val Thr Arg Arg Cys Arg Cys Pro Ser Cys Pro Pro 
4325 4330 



cga gec ggc ggg gcg tgt tea gtc gcg tgg cca get ccg tat egg 
Arg Ala Gly Gly Ala Cys Ser Val Ala Trp Pro Ala Pro Tyr Arg 



4335 

get egg 
Ala Arg 
4350 

cgt cgc 
Arg Arg 
4365 

cat ccg 
His Pro 
4380 



4340 



4345 



ccg egg gcg cct cac age ttc cgc cat cca ccg tgc ccg 

Pro Arg Ala Pro His Ser Phe Arg His Pro Pro Cys Pro 
4355 4360 

ccg etc ccg cat ccg tgc ccg cat egg age egg age cac 

Pro Leu Pro His Pro Cys Pro His Arg Ser Arg Ser His 
4370 4375 

get ccg tgc etc cgt cct ccg teg ggc ccg cgt cct etc 

Ala Pro Cys Leu Arg Pro Pro Ser Gly Pro Arg Pro Leu 
4385 4390 



cgc ccg tec egg agt cct ccg teg cgc ccc cgt ccg gee get ccg 
Arg Pro Ser Arg Ser Pro Pro Ser Arg Pro Arg Pro Ala Ala Pro 
4395 4400 4405 

teg gcg ggg get tgt ccg agg aac agg cca ccg cga aga gca gga 
Ser Ala Gly Ala Cys Pro Arg Asn Arg Pro Pro Arg Arg Ala Gly 
4410 4415 4420 

egg aca gga gcg cga gtc tea tgg gac etc tct ctt gaa ggc gcg 
Arg Thr Gly Ala Arg Val Ser Trp Asp Leu Ser Leu Glu Gly Ala 
4425 4430 4435 

get cag etc acg get gag cgc cgt ggt ate gga cag cag ctt egg 
Ala Gin Leu Thr Ala Glu Arg Arg Gly He Gly Gin Gin Leu Arg 
4440 4445 4450 

aag gca cgc ggc ggc ace ggc gcg cag cga etc cag cgc cgt etc 
L"s Ala Arg Gly Gly Thr Gly Ala Gin Arg Leu Gin Arg Arg Leu 
4455 4460 4465 

cat ggt gag gtg etc ggc cag cac gac gaa ggg ggc gee ctg ggc 
His Gly Glu Val Leu Gly Gin His Asp Glu Gly Gly Ala Leu Gly 
4470 4475 4480 

ras cgc ctt ggc cag etc cag cgc ctt gcg ccc ata ggc egg cgc 
Arc? Leu Gly Gin Leu Gin Arg Leu Ala Pro He Gly Arg Arg 
44b^ 4490 4495 

cas eca 9 ct cac cac cac 9 CC cac cgg cgt cgc cat cgc cat 

Glu Val Pro Ala His His His Ala His Arg Arg Arg His Arg His 
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4510 



cag etc cgt cga ggg cag cac ccg cgc etc cag ccc 
Gin Leu Arg Arg Gly Gin His Pro Arg Leu Gin Pro 
4515 4520 4525 



cag cat gtc 
Gin His Val 



73932 



cag ggc etc gga gat gag ccg 
Gin Gly Leu Gly Asp Glu Pro 
4530 4535 



cgc cgt cgt egg gtt gtc etc cag 
Arg Arg Arg Arg Val Val Leu Gin 
4540 



73977 



cac gtc cac ccg ccg cac ttc 
His Val His Pro Pro His Phe 
4545 4550 



ggc ccc ggg ggc cac 
Gly Pro Gly Gly His 
4555 



cac egg agg 
His Arg Arg 



74022 



cac egg egg etc cga gga cag cgt cgt gcg caa cag 
His Arg Arg Leu Arg Gly Gin Arg Arg Ala Gin Gin 
4560 4565 4570 



ccc gcg cac 74067 
Pro Ala His 



ctg ccg gat gtc gtc gaa ggg ctt gag cag gta gtc 
Leu Pro Asp Val Val Glu Gly Leu Glu Gin Val Val 
4575 4580 4585 



cac cac gec 74112 
His His Ala 



cag etc cag cgc ctg ctg cgt cgt cac cag cga egg gta gee cgt 
Gin Leu Gin Arg Leu Leu Arg Arg His Gin Arg Arg Val Ala Arg 
4590 4595 4600 



74157 



cat cag gat gac gcg cga gtt gga gta gag ccg ccg cgc ctg ctg 
His Gin Asp Asp Ala Arg Val Gly Val Glu Pro Pro Arg Leu Leu 
4605 4610 4615 



74202 



cgc cag etc cag ccc gga cag gee egg cag gtt ctt gtc cgt gac 74247 
Arg Gin Leu Gin Pro Gly Gin Ala Arg Gin Val Leu Val Arg Asp 
4620 4625 4630 

gat gag gtc cac tgg cgc ctg gcg cag caa ate cag cgc ttc etc 74292 
Asp Glu Val His Trp Arg Leu Ala Gin Gin lie Gin Arg Phe Leu 
4635 4640 4645 



gec get ggc cgc etc gat gac etc gca etc ctt gee cat cag gtc 
Ala Ala Gly Arg Leu Asp Asp Leu Ala Leu Leu Ala His Gin Val 
4650 4655 4660 



74337 



gcg gaa gac cat gcg gat gat ggt etc gtc gtc cac 
Ala Glu Asp His Ala Asp Asp Gly Leu Val Val His 
4665 4670 4675 



cac cag cag 743 82 

His Gin Gin 



ccg ctt gcg cag ctg egg egg 
Pro Leu Ala Gin Leu Arg Arg 
4680 4685 



cgc gtc cgt ggc ggg gaa gag tac 
Arg Val Arg Gly Gly Glu Glu Tyr 
469 0 



ccg gaa cac ggt ggc egg egg ggg cac gtc gcg aat ggc gec egg 
Pro Glu His Gly Gly Arg Arg Gly His Val Ala Asn Gly Ala Arg 
4695 4700 4705 



74427 



74472 



cgc cgc gag gec aat cgt cgc 
Arg Arg Glu Ala Asn Arg Arg 
4710 4715 



gtt gtg etc ctg ggc gat gcg ctt 
Val Val Leu Leu Gly Asp Ala Leu 
4720 



74517 



gca cac cgc cag ccc cag tec cgt gee gcg ctt gtt 
Ala His Arg Gin Pro Gin Ser Arg Ala Ala Leu Val 
4725 4730 " 4735 



ggc ggt .gac 
Gly Gly Asp 



74562 



gta ggg etc gaa gat gcg etc gcg cag etc cac ggg gat gec egg 



74607 
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Val Gly Leu Glu Asp Ala Leu Ala Gin Leu His Gly Asp Ala Arg 
4740 4745 4750 

gcc cca gtc cgc cac gta cag cac egg aga ggc gec etc gcg cga 74652 
Ala Pro Val Arg His Val Gin His Arg Arg Gly Ala Leu Ala Arg 
4755 4760 4765 

gag cac cac ctt cac gcg ccc gcg gcc ggc cat ggc gtc gcg cgc 74697 
Glu His His Leu His Ala Pro Ala Ala Gly His Gly Val Ala Arg 
4770 4775 4780 

gtt gtt cag cag gtt cag cgt gag ctg etc gat gag ccg cgc gtt 74742 
Val Val Gin Gin Val Gin Arg Glu Leu Leu Asp Glu Pro Arg Val 
4785 4790 4795 

gcc ctg gat ggt gat gtc etc ggg cgc etc cac etc cag aga cat 74787 
Ala Leu Asp Gly Asp Val Leu Gly Arg Leu His Leu Gin Arg His 
4800 4805 4810 

gcg cgc cga gtc egg gtt gac get gaa cag ctt cgc cgc cgc cca 74832 
Ala Arg Arg Val Arg Val Asp Ala Glu Gin Leu Arg Arg Arg Pro 
4815 4820 4825 

gat ggg cgc cgc cag cga cag gcg ctg ctg egg cgc gtg ccg etc 74877 
Asp Gly Arg Arg Gin Arg Gin Ala Leu Leu Arg Arg Val Pro Leu 
4830 4835 4840 

get ggc gag gcg gat gta gtc gga gac gat ctg ttc cat ccg etc 74922 
Ala Gly Glu Ala Asp Val Val Gly Asp Asp Leu Phe His Pro Leu 
4845 4850 4855 

cac ctg cgc cag cag cag ccg cag egg gcc ggt ggg gcc ace gtc 74967 
His Leu Arg Gin Gin Gin Pro Gin Arg Ala Gly Gly Ala Thr Val 
4860 4865 4870 

etc cgc gag cag ctg cgc gta ggc ctt cac gcc cag cag ggg ctg 75012 
Leu Arg Glu Gin Leu Arg Val Gly Leu His Ala Gin Gin Gly Leu 
4875 4880 4885 

ccg cag etc gtg cag cac etc cgc cgc cag ctg tgc cga gtg cgc 75057 
Pro Gin Leu Val Gin His Leu Arg Arg Gin Leu Cys Arg Val Arg 
4890 4895 4900 

gcc cgc acg ttg cag age cgc cgc cgc cgc gcg cgc ggc egg cag 75102 
Ala Arg Thr Leu Gin Ser Arg Arg Arg Arg Ala Arg Gly Arg Gin 
4905 4910 4915 

ate tec cgc etc cag ggc ctg gag caa ctg gga gag egg cgc ggg 75147 
lie Ser Arg Leu Gin Gly Leu Glu Gin Leu Gly Glu Arg Arg Gly 
4920 4925 4930 

tgt ttc cat ggc ccg age atg gcg gag gca gcg gta tct acg caa 75192 
Cys Phe His Gly Pro Ser Met Ala Glu Ala Ala Val Ser Thr Gin 
4935 4940 4945 

cgc ggg egg agg gat tec gtt gac ccg ggc ccc tgg ccg cgc gga 75237 
Arg Gly Arg Arg Asp Ser Val Asp Pro Gly Pro Trp Pro Arg Gly 
4950 4955 4960 

tgc tec teg cgt ccc ctg gcg ccc cgt cca gaa cgt gga cgc cac 75282 
Cys 5 <ar Ser Arg Pro Leu Ala Pro Arg Pro Glu Arg Gly Arg His 
4965 4970 4975 
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cac egg cct egg gag ggc ctt cat gec gca gac caa ccc 
His Arg Pro Arg Glu Gly Leu His Ala Ala Asp Gin Pro 
4980 4985 4990 

etc get ggt gee ccg gaa gat gac cga etc cga get ggc 
Leu Ala Gly Ala Pro Glu Asp Asp Arg Leu Arg Ala Gly 
4995 5000 5005 

cat ccg get gaa cat cga ggc gga get gga cgc cat caa 
His Pro Ala Glu His Arg Gly Gly Ala Gly Arg His Gin 
5010 5015 5020 



gtt cca 
Val Pro 



ccg etc 
Pro Leu 



cct eta 
Pro Leu 
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75327 



cgc cgc cca cat cga tgc cac cga caa cga gga ggc caa ggc cat 

Arg Arg Pro His Arg Cys His Arg Gin Arg Gly Gly Gin Gly His 
5025 5030 5035 

cct ccg tea cgt cat gga cga gga gcg cga gca cgc cgc cct ctt 

Pro Pro Ser Arg His Gly Arg Gly Ala Arg Ala Arg Arg Pro Leu 
5040 5045 5050 

ctg gca get gat tgc ccg gtt gga ccc cga gca ggc cca gca tga 

Leu Ala Ala Asp Cys Pro Val Gly Pro Arg Ala Gly Pro Ala 



75372 



75417 



75462 



75507 



75552 



5055 




5060 




5065 






ccgcgaggcc 


teggagaagt 


accggctcat 


caccaccggt 


gccccgcacg 


aggaggtgga 


75612 


ggcggtgggc 


aaggagggcg 


ggggagcacg 


cgccgaggtg 


gagaccaccc 


cgaccaagcg 


75672 


cctcaccgtg 


ggcaacctgc 


gccgctgagc 


cgctacgtgc 


tggagccgcc 


ggggccgctg 


75732 


ccgccctggc 


gccgctccgc 


cggcagcggc 


tccagatcca 


gggccgccat 


gtagacgaca 


75792 


ttcegggage 


cacgcttgcc 


ccggtacatg 


gcccggtccg 


ccaggtccag 


cagcgtggcc 


75852 


ttgtcccggg 


cgtgctcggg 


gaagctggcc 


acgccgatgc 


aegtgetgag 


cttcagcgac 


75912 


agcccctcgc 


gcgcgaggaa 


caggtgcgtc 


tccatggtgc 


ggcggatgcg 


ctccgccacc 


75972 


ttgagegege 


cgccggagtc 


ggtgttgcgc 


aggagcacca 


cgtactcgtc 


gccgccgtag 


76032 


cgggccacca 


cgtcgtggtc 


tctcacgcag 


cccttcacca 


cccgggccgt 


ctccaccaac 


76092 


agccgcgagc 


ccacgaggtg 


gccgtgcgtg 


tcgttgatgg 


acttgaagtg 


gtccaggtcc 


76152 


aggaacagca 


ggctgaaggg 


gcgctgcgtc 


tggacggagt 


cctggacctc 


gcggtccagc 


76212 


accaggtgea 


ggtagcgcgt 


gttgaagagg 


cgggtgaggt 


cgtcgacgta 


cgccaggtcc 


76272 


tccaccgccg 


cgaagcgccc 


gaggttgege 


agegecagtg 


cccagttgcg 


caccaggtag 


76332 


cccgccgtct 


cgccggtcca 


ctccggctcg 


gcgccgtcga 


agaagagcac 


cgcgtggccc 


76392 


agcaccgtgg 


tgccctccac 


ggcggggatg 


gagaggaege 


gegggaagge 


ggcgtccagc 


76452 


ccttccagct 


cccggggcgc 


gcgcgcgttc 


gagaggeget 


cgacgagggt 


ggccaccagc 


76512 


tccggctcgt 


gecgeggagg 


cagtccgcgg 


acgccctgga 


gccgcaggcc 


cttgtcgccg 


76572 


tcgcgctcca 


gcagcaccac 


ggcgctggcg 


caggecatgg 


actccagcgc 


gctcgtggtg 


76632 


geggaggeca 


gcttctcgcg 


ctccagcgtg 


gtggcgatgc 


gctggccggc 


ctccagcatg 


76692 


gcgacgtgcc 


ggegcagega 


ggcgttctcc 


tggaggaggt 


ctcgcgtggt 


gagggcaegg 


76752 
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cggacggcat 


gc t crcracrorac 

9 '*** 53 53 *^53 53 9 *"■* 


etc aacracrc c 


aceggcttga 


cgaggtactc 


ggccgcgccg 


76812 


ctcttgatgg 


cgcgcacggc 


ggggtccacc 


ttctccaggc 


eggtgatgae 


caccacctcc 


76872 


acgccggggt 


ggcgctcgcg 


cacgtggcgc 


agcacctcca 


tgccgtcgcc 


geeggggaaa 

3 ^9999 *"*9 9 


76932 


atgaggtccg 


tcaccaccgc 


gtcgaagegg 


tccccggcca 


gcgcctcccg 


cacatcctaa 


76992 


agcgagccca 


ccgccgtgac 


ggcgtgcccc 

3 53 3 3 


acggccgtga 

3 3 53 9 


ggtagtcgee 


gtagagggta 

9 ww* 53 <— <- 3 53 3 ^ 53 


77052 


cgggcgatct 


tttcgtcgtc 


aacaaaaaga 


atgcgcgcca 


tctggctcga 


tggcttacca 


77112 


ccaaggcgag 


gaaggctgct 


agggtagggc 

933 3 3 3^ 


cgtgtcaccc 


ttcgaaagcg 


gacgccggct 


77172 


ctgccttctc 


gtggaggcag 


gagagacgeg 


etacgeggtg 

3 3 3 w 3 


gaggccacct 


ccgtcatgga 


77232 


agtggccatg 


cccggcgcca 


acggctccag 


cctgcgtggc 


gtgctggagg 


t gaaggac c t 


77292 


gtcggccctg 


ctcggcggct 


tgcccgagga 


ggggc eggge 

3333 ww 3353 


ataatgotga 

>»"■ w 3 9 3 3 3 3 


tgctcgacgt 


77352 


gagtcccacc 


ctggccgtac 


gagtacactc 


catggtggag 

53 ^ 53 53 53 3 ^53 


gtcgccgacg 


tggegeggge 

^39^9^-'339 v ' 


77412 


gccgttcttc 


ctcctgcccc 


ccgggctggc 


goactegcta 


gcccccctga 


ccccrgggcac 


77472 


ggtgctgcac 


aaggagcggc 

- - 3 3 ~~3 S3 9 


tctacctgga 


gctcatcgcc 


aaggcgctgc 

53 ^53 53 ^ 53 ^ ^ 53 ^ 


cccaccaaat 


77532 


gggccctcgt 


gccgcgcccg 


cggccctgcg 


ccccgtccac 


tggaccgagt 

^*53539 v '^*9'-*53 *- 


ccacgcccga 


77592 


caaggcgctc 


gtcttcgaat 


cgcaggggcg 

3" ***3 9 9 9 v 9 


cctgttcggc 


gtcccgctgg 


gcttcgtctc 


77652 


ccaggtgata 


t cgcgagggg 

3 3 wk 9 9 9 3 


aggecttcag 


cgtcctgccc 


gtgeagagea 


ggccggtggc 


77712 


cggcatcttc 


ccgcacgccc 


acrat aetata 

53 3 53 ^" 53 53 


gcccatctgc 


tcaatcccca 


cgctgctggg 

23 ^3 ^* 23 23 23 


77772 


ctcccccccg 


gtacccgagg 


ccctcttcat 


cctcacggag 


ctgaccgacc 


ggaacgtggg 

23 23 v *"" 23 23 C3 23 


77832 


gctgacggcc 


acccgggtgc 


tcaacatact 


gcagcacttc 


23 23 23 J3 2? 


ccaccattaa 


77892 


agccttccgt 


gcccccgggt 


tgacaggctc 


cgtgctgttc 


ctggacctgc 


agcgcatgtt 


77952 


ttcttgatcg 


tgctggagcg 


egaaaegtaa 


gctgccccgc 


ttgatacagg 


gccgcgacct 


78012 


gggtttttcg 


cggcaggaat 


ctccaacgaa 


ttcagggggg 


tgggacgccc 


cccgaggccc 


78072 


gaaccgatgc 


ccaagaatct 


gctggtcgcc 


gatgactege 


tcaccatccg 


caaggtgatt 


78132 


gggatgatct 


tcgcgaccga 


ggactttcag 


gtg acc gcg gtg gac aac ggg 
Val Thr Ala Val Asp Asn Gly 


78183 



5070 



ctg gac gec ate tec cgt acc cgc gag ctg cgc ccg gac gtc gtc 
Leu Asp Ala lie Ser Arg Thr Arg Glu Leu Arg Pro Asp Val Val 
5075 5080 5085 



78228 



etc gec gat gtg atg atg ccg ggc aag age ggc tac gag gtc tgc 
Leu Ala Asp Val Met Met Pro Gly Lys Ser Gly Tyr Glu Val Cys 
5090 5095 5100 



78273 



gag gee ctg aag age gac ccg gec acc cag ggc ate ccc gtg gtg 
Glu Ala Leu Lys Ser Asp Pro Ala Thr Gin Gly lie Pro Val Val 
5105 5110 5115 



78318 
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ctg ctg gcg ggg acc ttc gag gcc ttc gac gag aac cgc gcc cgc 783 63 

Leu Leu Ala Gly Thr Phe Glu Ala Phe Asp Glu Asn Arg Ala Arg 
5120 5125 5130 



gcc gcc cgc gcc gat gac cac ate gcc aag ccc ttc gag age cag 78408 
Ala Ala Arg Ala Asp Asp His lie Ala Lys Pro Phe Glu Ser Gin 
5135 5140 5145 



gtg ctg etc gac aag gtg aag gcg ctg gtg ggg cag aag tec aac 78453 
Val Leu Leu Asp Lys Val Lys Ala Leu Val Gly Gin Lys Ser Asn 
5150 5155 5160 



acc atg ccg gcc tec gcg gcc acg cag gtg cgc ccg ccc acc gcc 78498 
Thr Met Pro Ala Ser Ala Ala Thr Gin Val Arg Pro Pro Thr Ala 
5165 5170 5175 



cag ccg gcc act ccc gcg ccg gtg cct ccg ggc ccg cag ccg ggc 78543 
Gin Pro Ala Thr Pro Ala Pro Val Pro Pro Gly Pro Gin Pro Gly 
5180 5185 5190 



gcg ccc acg gcc gcc cgt ccg ggc att ccg cag ggc acc cag ccg 78588 
Ala Pro Thr Ala Ala Arg Pro Gly lie Pro Gin Gly Thr Gin Pro 
5195 5200 5205 



ggc gtg ccg cgt cct ccg ggc ccg ggc atg ccg ccg ggt get ccg 78633 
Gly Val Pro Arg Pro Pro Gly Pro Gly Met Pro Pro Gly Ala Pro 
5210 5215 5220 



ggc gcg ccg cgt cct ccg ggg cct gga atg ccg ccg ggg ccg ggc 78 678 

Gly Ala Pro Arg Pro Pro Gly Pro Gly Met Pro Pro Gly Pro Gly 
5225 5230 5235 



atg gcc cgt cct ccc ggg cct gga atg ccg cct ccg ggt get ccg 78723 
Met Ala Arg Pro Pro Gly Pro Gly Met Pro Pro Pro Gly Ala Pro 
5240 5245 5250 



ggc gcg ccg cgt cct ccg ggg cct gga atg ccg ccg gga ccg ggc 78768 
Gly Ala Pro Arg Pro Pro Gly Pro Gly Met Pro Pro Gly Pro Gly 
5255 5260 5265 



atg gcc cgc ccg ccg ggg cct gga atg ccg ccg ccg ggt get ccg 78813 
Met Ala Arg Pro Pro Gly Pro Gly Met Pro Pro Pro Gly Ala Pro 
5270 5275 5280 



ggc gcg ccg cgt cct ccg ggc ccg ggc atg ccg cct ccg ggc gcc 78858 
Gly Ala Pro Arg Pro Pro Gly Pro Gly Met Pro Pro Pro Gly Ala 
5285 5290 5295 



gcc gcg cgt ccc ggg cgt gcc gcc tec gcc ggg tgc gtc cgc gcc 78903 
Ala Ala Arg Pro Gly Arg Ala Ala Ser Ala Gly Cys Val Arg Ala 
5300 5305 5310 



ggg cat gcc gcc gcg tec ggg cgt gcc cgc gcc tec ggg tgg gcc 78948 
Gly His Ala Ala Ala Ser Gly Arg Ala Arg Ala Ser Gly Trp Ala 
5315 5320 5325 



ggc gcc ggc atg gcc cgt ccg ggg att ccc cag ggc age cag ccg 78993 
Gly Ala Gly Met Ala Arg Pro Gly lie Pro Gin Gly Ser Gin Pro 
5330 5335 5340 



ggc ttc ccg cgt cct ccg ggc ggc gcc gcg ccg ctg cct ccc gcg 79038 
Gly Phe Pro Arg Pro Pro Gly Gly Ala Ala Pro Leu Pro Pro Ala 
5345 5350 5355 
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ccg ggt ggc gcg acc cag ccg gcg gcc cgc gcg agg gac ccg ttc 
Pro Gly Gly Ala Thr Gin Pro Ala Ala Arg Ala Arg Asp Pro Phe 
5360 5365 5370 



79083 



ggc ctg ggc get ccc acg gcg gcg ccg cag gca gcc cag cct tec 
Gly Leu Gly Ala Pro Thr Ala Ala Pro Gin Ala Ala Gin Pro Ser 
5375 5380 5385 



79128 



ate age ate gag gac teg ctg ccc gag ccc age ggt gcc gag gag 
lie Ser lie Glu Asp Ser Leu Pro Glu Pro Ser Gly Ala Glu Glu 
5390 5395 5400 



79173 



ate tec ctg gac atg ggc ggc ccc ccg acg gcc get ccg acc gcg 
lie Ser Leu Asp Met Gly Gly Pro Pro Thr Ala Ala Pro Thr Ala 
5405 5410 5415 



79218 



gtg gcg ccg acc gcc get ccg gcc acg gcc cgt ccg acc gcc gcg 
Val Ala Pro Thr Ala Ala Pro Ala Thr Ala Arg Pro Thr Ala Ala 
5420 5425 5430 



79263 



gat ggt ggc gag gcg ctg ctg cgc gag gcg etc teg aag gcc tec 
Asp Gly Gly Glu Ala Leu Leu Arg Glu Ala Leu Ser Lys Ala Ser 
5435 5440 5445 



79308 



cgc gag gtc ate gag aag att gcc tgg gag gtg gta ccg cag ctg 
Arg Glu Val He Glu Lys He Ala Trp Glu Val Val Pro Gin Leu 
5450 5455 5460 



79353 



gcg gag acc ate ate cgt gag gag etc gag egg etc ate aag gac 
Ala Glu Thr He He Arg Glu Glu Leu Glu Arg Leu He Lys Asp 
5465 5470 5475 



79398 



egg gag acg cag cac tga cgcgttcctc gccttaacct ctctccaaga 

Arg Glu Thr Gin His 

5480 



79446 



ccttcccgcc ggccccggtt cccccggggc tggccgtccg caatgaccga caccactgaa 79506 

ctgtccaagg cctacgagcc caccgaggtc gaggcccgct ggtacgcctg ctgg atg 79563 

Met 
5485 



gag egg aac tac ttc cgc gcc gag gcg acc tec gac aag ccc gcc 
Glu Arg Asn Tyr Phe Arg Ala Glu Ala Thr Ser Asp Lys Pro Ala 
5490 5495 5500 



79608 



ttc tec ate gtg ctg ccg ccg ccc aac gtg acg ggc age etc cac 
Phe Ser He Val Leu Pro Pro Pro Asn Val Thr Gly Ser Leu His 
5505 5510 5515 



79653 



ggc cac gcg etc acc gcc acc ate cag gac ate etc gcg cgc 
He Gly His Ala Leu Thr Ala Thr He Gin Asp He Leu Ala Arg 
5520 5525 5530 



79698 



tgg aag egg atg age ggc ttc aac acc ctg tgg etc ccc ggc acg 
Tm Lvfl Arg Met Ser Gly Phe Asn Thr Leu Trp Leu Pro Gly Thr 
5535 5540 5545 



79743 



~ac cac gcc ggc ate gcc acg cag atg gtg gtg gag aag gag ctg 
His Ala Gly He Ala Thr Gin Met Val Val Glu Lys Glu Leu 
5550 5555 5560 



79788 
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aag aag acc gag ggc aag age egg cac gac etc ggc cgc gee gec 
Lys Lys Thr Glu Gly Lys Ser Arg His Asp Leu Gly Arg Ala Ala 
5565 " 5570 5575 
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ttc ctg gag cgc gtc tgg gag tgg aag ggc 
Phe Leu Glu Arg Val Trp Glu Trp Lys Gly 
5580 5585 



aag tac ggc gee cgc 
Lys Tyr Gly Ala Arg 
5590 



79878 



ate ggc gag cag cac cgc ttc ctg ggc gee teg ctg gac tgg age 
lie Gly Glu Gin His Arg Phe Leu Gly Ala Ser Leu Asp Trp Ser 
5595 5600 5605 



79923 



cgc gag cgc ttc acc atg gac gag cag tec tec gee gee gtg cgc 
Arg Glu Arg Phe Thr Met Asp Glu Gin Ser Ser Ala Ala Val Arg 
5610 5615 5620 



79968 



gag gtc ttc gtg cgc 
Glu Val Phe Val Arg 
5625 



ctg tac gaa gag ggc ctg atg tac egg gee 
Leu Tyr Glu Glu Gly Leu Met Tyr Arg Ala 
5630 5635 



80013 



cag aag etc ate aac tgg tgc ccc teg tgc cgc acc gee etc age 
Gin Lys Leu lie Asn Trp Cys Pro Ser Cys Arg Thr Ala Leu Ser 
5640 5645 5650 



80058 



gac ttg gag gtg gag 
Asp Leu Glu Val Glu 
5655 



cac gag gag teg aag aag ggc tec ate tgg 
His Glu Glu Ser Lys Lys Gly Ser lie Trp 
5660 5665 



80103 



cac ate cgc tac ccc gtc aag gac age gac cgc acg etc acc gtg 
His lie Arg Tyr Pro Val Lys Asp Ser Asp Arg Thr Leu Thr Val 
5670 5675 5680 



80148 



gec acc aca cgc ccg 
Ala Thr Thr Arg Pro 
5685 



gag acg atg ctg ggc gac acc gcg gtg gec 
Glu Thr Met Leu Gly Asp Thr Ala Val Ala 
5690 5695 



80193 



ate cac ccg gag gac gag cgc tac ctg ggg ctg gee ggc aag age 
He His Pro Glu Asp Glu Arg Tyr Leu Gly Leu Ala Gly Lys Ser 
5700 5705 5710 



80238 



gtg aag ctg ccg ctg acc gac cgc gag att ccc ate ate gcg gac 80283 
Val Lys Leu Pro Leu Thr Asp Arg Glu He Pro He He Ala Asp 
5715 5720 5725 

gcg gag ctg gtg gac ccg aag ttc ggc acc ggc gtg gtg aag gtg 80328 
Ala Glu Leu Val Asp Pro Lys Phe Gly Thr Gly Val Val Lys Val 
5730 5735 5740 



acg ccg gcg cac gac 
Thr Pro Ala His Asp 
5745 



ttc aac gac tac cag acg ggc ctg egg cac 
Phe Asn Asp Tyr Gin Thr Gly Leu Arg His 
5750 5755 



80373 



aag ctg ccc atg etc acc ate ctg gac gaa gcg gee egg atg acg 
Lys Leu Pro Met Leu Thr He Leu Asp Glu Ala Ala Arg Met Thr 
5760 5765 5770 



80418 



aag gag acc ggc aag 
Lys Glu Thr Gly Lys 
5775 



tac gee ggc atg gat cgc ttc gag gcg cgc 
Tyr Ala Gly Met Asp Arg Phe Glu Ala Arg 
5780 5785 



80463 



aag cag gtg ctg gcg gac etc acg gag cag ggg ctg ctg gag aag 
Lys Gin Val Leu Ala Asp Leu Thr Glu Gin Gly Leu Leu Glu Lys 
5790 5795 5800 



80508 
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gaa gag ccc cac atg ctg aac gtg ggc acc tgc cag cgc tgc gcc 
Glu Glu Pro His Met Leu Asn Val Gly Thr Cys Gin Arg Cys Ala 
5805 5810 5815 

acg gtg gtg gag cca cgc ctg tct ccg cag tgg ttc ate aag att 
Thr Val Val Glu Pro Arg Leu Ser Pro Gin Trp Phe He Lys lie 
5820 5825 5830 

gaa ccg ctg gcg aag ccg gcc ate gag gcg gtg gag cag ggc cgc 
Glu Pro Leu Ala Lys Pro Ala He Glu Ala Val Glu Gin Gly Arg 
5835 5840 5845 

acg aag ttc gtc ccc gag tec tgg acg aac acg ttc ttc cac tgg 
Thr Lys Phe Val Pro Glu Ser Trp Thr Asn Thr Phe Phe Ha.s Trp 
5850 5855 5860 

atg cgc aac ate cac gac tgg tgc gtc age cgc cag ctg tgg tgg 
Met Arg Asn He His Asp Trp Cys Val Ser Arg Gin Leu Trp Trp 
5865 5870 5875 

ggc cac cag att ccc gcg tac tac tgc acc teg tgc age ccg egg 
Gly His Gin He Pro Ala Tyr Tyr Cys Thr Ser Cys Ser Pro Arg 
5880 5885 5890 

cag ggc gac gac acg gac ctg ccg ctg gac gcc ccc acg gtg aag 
Gin Gly Asp Asp Thr Asp Leu Pro Leu Asp Ala Pro Thr Val Lys 
5895 5900 5905 

gtg ggc ggg gtg gac ttc gcg cgc gcg gag ccc ate gtc gcg cgc 
Val Gly Gly Val Asp Phe Ala Arg Ala Glu Pro He Val Ala Arg 
5910 5915 5920 

gag cag ccg aag tec tgc ccg aag tgc ggc ggc gcg tec ttc ate 
Glu Gin Pro Lys Ser Cys Pro Lys Cys Gly Gly Ala Ser Phe He 
5925 5930 5935 



cag gac ccg gac gtg ctg gac acc tgg ttc 
Gin Asp Pro Asp Val Leu Asp Thr Trp Phe 
5940 5945 



teg tec ggc ctg tgg 
Ser Ser Gly Leu Trp 
5950 



ccg ttc tec acg ctg ggc tgg ccg cgt gag acg ccg gag ctg aag 
Pro Phe Ser Thr Leu Gly Trp Pro Arg Glu Thr Pro Glu Leu Lys 
5955 5960 5965 

acc ttc tac ccg acg tec gtc atg gag acg ggc cac gac ate ate 
Thr Phe Tyr Pro Thr Ser Val Met Glu Thr Gly His Asp He lie 
5970 5975 5980 

ttc ttc tgg gtc gcc egg atg atg atg atg ggc ate cac ttc atg 
Phe Phe Trp Val Ala Arg Met Met Met Met Gly He His Phe Met 
5985 5990 5995 

ggg gat gtg ccc ttc cgc acc gtg tac ctg cac gcg atg gtg cgc 
Gly Asp Val Pro Phe Arg Thr Val Tyr Leu His Ala Met Val Arg 
6000 6005 6010 

gac gag aag ggc cag aag atg tec aag acg aag ggg aac gtc ate 
Aso Glu Lys Gly Gin Lys Met Ser Lys Thr Lys Gly Asn Val lie 
6015 6020 6025 

gac ccc ctg gac ate gtc etc ggc gcc ccg gcc gac aag etc cag 
Asp Pro Leu Asp He Val Leu Gly Ala Pro Ala Asp Lys Leu Gin 



80553 



80598 



80643 



80688 



80733 



80778 



80823 



80868 



80913 



80958 



81003 



81048 



81093 



81138 



81183 



81228 
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6035 



6040 



ccg acg ctg aag aac egg ttc ccg cag ggc atg ccg ccg ttc ggc 
Pro Thr Leu Lys Asn Arg Phe Pro Gin Gly Met Pro Pro Phe Gly 
6045 6050 6055 

gcg gac gcg ctg cgc ttc acg etc gec teg etc ace cag cag ggc 
Ala Asp Ala Leu Arg Phe Thr Leu Ala Ser Leu Thr Gin Gin Gly 
6060 6065 6070 

cgc gac ate aag ctg teg atg gac egg etc gag ggc tac aag gee 
Arg Asp lie Lys Leu Ser Met Asp Arg Leu Glu Gly Tyr Lys Ala 
6075 6080 6085 

ttc ggc aac aag ctg tgg aac gee age cgc ttc gee ctg atg aac 
Phe Gly Asn Lys Leu Trp Asn Ala Ser Arg Phe Ala Leu Met Asn 
6090 6095 6100 

atg ggc gac ttc egg atg gag gee ggc ccg ctg gac egg age aca 
Met Gly Asp Phe Arg Met Glu Ala Gly Pro Leu Asp Arg Ser Thr 
6105 6110 6115 
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81273 



ctg acg ctg gcg gac cgc tgg att etc tec 
Leu Thr Leu Ala Asp Arg Trp lie Leu Ser 
6120 6125 



cgc etc cag cgc gee 
Arg Leu Gin Arg Ala 
6130 



acc acg gag gcg cgc gcg ctg ctg gag teg tac age ttc ggc gag 
Thr Thr Glu Ala Arg Ala Leu Leu Glu Ser Tyr Ser Phe Gly Glu 
6135 6140 6145 



gec gee tec acg etc tac cag ttc etc tgg 
Ala Ala Ser Thr Leu Tyr Gin Phe Leu Trp 
6150 6155 



gcg gag ttc tgc gac 
Ala Glu Phe Cys Asp 
6160 



tgg tac ate gag ctg 
Trp Tyr lie Glu Leu 
6165 



gcg aag ggc teg ctg tac ggc gag gac gcg 
Ala Lys Gly Ser Leu Tyr Gly Glu Asp Ala 
6170 6175 



gcg gcg aag gac tec acg cgc egg gtg etc gtg tac tgc ctg gac 
Ala Ala Lys Asp Ser Thr Arg Arg Val Leu Val Tyr Cys Leu Asp 
6180 6185 6190 

cgc ate ctg egg ctg ctg cac ccg ttc atg ccg ttc ate acc gag 
Ara He Leu Arg Leu Leu His Pro Phe Met Pro Phe He Thr Glu 
6195 6200 6205 

gag ate tgg cag aag ctg ccg atg gee egg ccg acg gac age ate 
Glu He Trp Gin Lys Leu Pro Met Ala Arg Pro Thr Asp Ser He 
6210 6215 6220 



atg att gee ccg tac 
Met He Ala Pro Tyr 
6225 



ccg gag ccg gac gcg gcg cac gtg gac gag 
Pro Glu Pro Asp Ala Ala His Val Asp Glu 
6230 6235 



gcg gcg gaa ggc gag atg gcg ccg gtc ate gee gee ate gag ggc 
Ala Ala Glu Gly Glu Met Ala Pro Val He Ala Ala He Glu Gly 
6240 6245 6250 

ctg cgc acc ate cgc ggc gag age aac ctg teg ccc gee acc aag 
Leu Arg Thr He Arg Gly Glu Ser Asn Leu Ser Pro Ala Thr Lys 
6255 6260 6265 



81318 



81363 



81408 



81453 



81498 



81543 



81588 



81633 



81678 



81723 



81768 



81813 



81858 



81903 



gtg aag gcg gtg gtg cag age gcg gac gtg cgc acg cgc gag ctg 



81948 
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Gin Ser Ala Asp Val Arg Thr Arg Glu Leu 
6275 6280 



ctg gag cgc tgg cgc 
Leu Glu Arg Trp Arg 
6285 



ggc tac etc atg ccg ctg gec ggc ctg tec 
Gly Tyr Leu Met Pro Leu Ala Gly Leu Ser 
6290 6295 



81993 



gag gtg acg gtg gga 
Glu Val Thr Val Gly 
6300 



gcg ccg ggc gec aag 
Ala Pro Gly Ala Lys 
6305 



ccg ccg cag gec gec 
Pro Pro Gin Ala Ala 
6310 



82038 



gec ttc gtg ggc tec 
Ala Phe Val Gly Ser 
6315 



aac ctg gag ate tac gtc ccg ctg gcg ggc 
Asn Leu Glu lie Tyr Val Pro Leu Ala Gly 
6320 6325 



82083 



etc gtc gac ctg gac 
Leu Val Asp Leu Asp 
6330 



gcg gag cgc gag egg ttg aag aag gag att 
Ala Glu Arg Glu Arg Leu Lys Lys Glu lie 
6335 6340 



82128 



gec cgc gec gag cag 
Ala Arg Ala Glu Gin 
6345 



gag atg gee ggc gtg ctg cgc aag ctg gag 
Glu Met Ala Gly Val Leu Arg Lys Leu Glu 
6350 6355 



82173 



aac ccc aac ttc gtg 
Asn Pro Asn Phe Val 
6360 



gee aag gcg ccc ccg gac gtg gtg gag aag 
Ala Lys Ala Pro Pro Asp Val Val Glu Lys 
6365 6370 



82218 



gac cgc gec cgc gtc 
Asp Arg Ala Arg Val 
6375 



gag gag ctg aag gag cgc aag gee aag ctt 
Glu Glu Leu Lys Glu Arg Lys Ala Lys Leu 
6380 6385 



82263 



cag gac cac ctg cag 
Gin Asp His Leu Gin 
6390 



egg att gec ccg gag ccc gee atg cca gag 
Arg lie Ala Pro Glu Pro Ala Met Pro Glu 
6395 6400 



82308 



acg ccg ccg tec gag 
Thr Pro Pro Ser Glu 
6405 



age acc act ccc gag age age ace ccc ace 
Ser Thr Thr Pro Glu Ser Ser Thr Pro Thr 
6410 6415 



82353 



gcg gaa gag ggt gec tec ccc tec gee ccg gee cag gtg aag gtc 
Ala Glu Glu Gly Ala Ser Pro Ser Ala Pro Ala Gin Val Lys Val 
6420 6425 6430 



82398 



gee ccg aat cag gaa 
Ala Pro Asn Gin Glu 
6435 



gag aag ggc ggc gtg gac ctg ggc cag gag 
Glu Lys Gly Gly Val Asp Leu Gly Gin Glu 
6440 6445 



82443 



ctg aag ggt gaa ctg ggc gag acc ggc acc gcg tec gag gec ggg 
Leu Lys Gly Glu Leu Gly Glu Thr Gly Thr Ala Ser Glu Ala Gly 
6450 6455 6460 



82488 



gac acg cag gtg cag gac gcg etc gcg aag ctg cgc gag ggc acg 
Asp Thr Gin Val Gin Asp Ala Leu Ala Lys Leu Arg Glu Gly Thr 
6465 6470 6475 



82533 



aag gag ggc ctg teg cca tec gac cac cat gac etc ggc gtg gcg 
Lys Glu Gly Leu Ser Pro Ser Asp His His Asp Leu Gly Val Ala 
6480 6485 6490 



82578 



tac atg acc atg ggg ctg gtg gac gac gcg atg cgc gag ttc aac 
Tyr Met Thr Met Gly Leu Val Asp Asp Ala Met Arg Glu Phe Asn 
6495 6500 6505 



82623 
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egg gec aag gag ggt ggc gac acc cgc gag acg ccg gec gag ggc 
Arg Ala Lys Glu Gly Gly Asp Thr Arg Glu Thr Pro Ala Glu Gly 
6510 6515 6520 

gec gag gcg gag gag ggt gag tec ggc ccg gtg aag gee cac ggg 
Ala Glu Ala Glu Glu Gly Glu Ser Gly Pro Val Lys Ala His Gly 
6525 6530 6535 

gac atg ctg ccc acg cgc gag gtg acg cag ggc gag ccc gtg cgc 
Asp Met Leu Pro Thr Arg Glu Val Thr Gin Gly Glu Pro Val Arg 
6540 6545 6550 

tec gec teg aag aag aag gcg acc get ccc ggc aag gtg cac ggc 
Ser Ala Ser Lys Lys Lys Ala Thr Ala Pro Gly Lys Val His Gly 
6555 6560 6565 

gac aag ctg gcg ccc gee ccg acc acc gag gag gee gag gee ccc 
Asp Lys Leu Ala Pro Ala Pro Thr Thr Glu Glu Ala Glu Ala Pro 
6570 6575 6580 

gcg gtg aag aag gcg gee gc 
Ala Val Lys Lys Ala Ala 
6585 



82668 



82713 



82758 



82803 



82848 



82868 



<210> 19 

<211> 358 

<212> PRT 

<213> Angiococcus discif ormis 

<400> 19 

Val Gly Arg Gly Leu Gin Pro Leu Pro Pro Asp Ala Pro His Pro Arg 
1 5 10 15 



Pro Ala Arg Cys Glu Gly Arg Arg Trp Gin Gly Leu Gly Ala His Gly 
20 25 30 



Gin Pro His Leu Arg Pro Arg Arg Gly Glu Ser Gly Pro Gly Arg Cys 
35 40 45 



Arg Asp Gly Ser His Arg Glu Gly Thr His Pro Pro Gin Ser Arg Ser 
50 55 60 



Tyr Leu Ala Met Thr Ala Asp Ser Met Asp Ala Thr Val Pro Ser Arg 
65 70 75 80 



Arg Pro Ala Leu Gly Phe lie Gly Gly Gly Val Met Ala Glu Ala Leu 
85 90 95 



lie Gly Ala Val Thr Gly Gly Gly Leu Ala Ala Pro Gly Thr Leu Leu 
100 "* "* 105 110 



WO 2004/022586 ^W>CT/EP2003/009780 
^ 200 

Val Gly Glu Val Asp Glu Glu Arg Arg Arg Val Leu Arg Glu Gin Tyr 
115 120 125 

Gly Val Glu Val Thr Ala Asp Asn Leu Ala Ala Ala Asp Ala Glu Leu 
130 135 140 

Val Val Leu Ala Val Lys Pro Gin His Leu Gly Val Val Leu Gin Gin 
145 150 155 160 

Leu Arg Gly Gin Leu Arg Glu Asp Thr He Val Val Ser He He Ala 
165 170 175 

Gly Ala Arg Leu Gin Thr Leu Cys Glu Gly Leu Glu His Pro Leu Val 
180 185 190 

Val Arg Ala Met Pro Asn Thr Pro Ala Arg Val Arg Lys Ala Ala Thr 
195 200 205 

Phe Trp He Pro Ser Ala Gly Leu Ser Glu Pro Ala Leu Val Arg Val 
210 215 220 

Arg Ser Leu Leu Gly Ala Leu Gly Thr Glu Val Glu Val Ala Asp Glu 
225 230 235 240 

Ala Gly Val Glu Met Ala Thr Gly Leu Ala Gly Pro Met Pro Ala Phe 
245 250 255 

Val Tyr Phe Leu He Glu Ala Phe He Asp Ala Gly Val Ala Leu Gly 
260 265 270 

Leu Pro Arg Asp Gin Ala Thr Leu Ala Thr Val Glu Ser Met Arg Gly 
275 280 285 

Ser Leu Glu Leu Leu His Arg Ser Gin Glu Ser Pro Ala Ala Leu Arg 
290 295 300 

^ys Gin Val Thr Ser Pro Gly Gly Ala Thr Leu Ala Gly Leu Lys Val 
::3 310 315 320 

Phe Glu Ser Ala Gly Val Arg Asp Ala Met Asp Ala Ala Val Arg Ala 
325 330 335 

"32 Mis Ala Arg Ala Gin Glu Leu Gly Arg Leu Pro Leu Ala Pro Leu 
340 345 350 



Glu Pro Gly Lys Glu Gly 
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<210> 20 
<211> 420 
<212> PRT 

<213> Angiococcus discif ormis 
<400> 20 

Val Pro Gly Val Ala Val Ala Val Val Ser His Gly Arg Val lie Val 
15 10 15 

Ala Lys Gly Tyr Gly Phe Ser Asn Leu Glu His Gin Val Pro Val Thr 
20 25 30 

Pro Asp Thr Leu Phe Gin Ser Gly Ser Leu Gly Lys Gin Phe Thr Ala 
35 40 45 

Met Ala Val Met Leu Gin Val Glu Ala Gly Arg lie Ser Leu Ser Asp 
50 55 60 

Ser He Thr Lys Tyr Phe Pro Asp Ala Pro Ser Thr Trp Ala Pro He 
65 ' 70 75 80 

Thr He Arg His Leu Leu Thr His Thr Ser Gly Leu Ser Asp Gly Leu 
85 90 95 

Leu Asp Leu Arg Lys Asp Tyr Thr Asp Glu Glu Phe Ala Gin Phe Leu 
100 105 HO 

, Tvr Thr Leu Pro Leu Asp Phe Pro Ala Gly Leu Arg Trp Asn Tyr Ser 
' 115 120 125 

Asn Ala Gly Tyr Val Leu Leu Gly Leu Leu Val Asn Arg Val Ala Gly 
130 " ' 135 140 

Thr Ser Tyr Val Asn Val Leu Gly Glu Gin Val Phe Lys Pro Thr Arg 
145 150 155 160 

Met Lys Thr Ala Arg Gly He Ser Glu Ala Asp He Val Pro Asn Arg 
165 170 175 

Ala Ala Gly Tyr Arg Leu Leu Asp Gly Val Val Lys Asn Gin Glu Trp 
180 185 190 
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Val Ser Pro Ser Leu Asn Thr Thr Ala Asp Gly Ser Leu Tyr Phe Ser 
195 200 205 

Leu Lys Asp Met Leu Ala Trp Asp Asp Ala Val Asp Asp Arg Ala lie 
210 215 220 

Leu Thr Asn Gly Ser Trp Arg Asp He Leu Ser Pro Val Lys Leu Ser 
225 230 235 240 

Ser Gly Ala Ser Tyr Pro Tyr Gly Phe Gly Trp Phe He Glu Glu Arg 
245 250 255 

Asn Gly Gin Leu Val His Gin His Ala Gly Ala Trp Gin Gly Phe Lys 
260 265 270 

Thr Ala Tyr Tyr Arg Phe Thr Gly Asp Ala Leu Ser He He Val Leu 
275 280 285 

Leu Asn Leu Ala Glu Pro Asn Pro Ala Ala He Ala Asn Gly He Ala 
290 295 300 

Ala He Val Asn Pro Ala Leu Ala Val Pro Pro Leu Ala Pro He Pro 
305 310 315 320 

Asp Leu Glu Pro Glu Val Thr Ala Arg Leu Ala Thr Leu Leu Glu Gin 
325 330 335 

Ala Arg Ala Gly Ala Leu Asn Pro Ala Glu Phe Ala Tyr Val Pro Gly 
340 345 350 

Trp Phe Phe Thr Glu Ala Ala Pro Tyr Tyr Gin Ser Leu Leu Gin Ser 
355 360 365 

Leu Gly Pro Ser Gly Pro Leu Val Leu Ala Lys Arg Glu Met Arg Gly 
370 375 380 

Asp Asp Arg Val Tyr Thr Tyr Leu Val Gin Ala Gly Ser Thr Met Leu 
385 390 395 400 

Arg Tyr Arg Val Ala Leu He Pro Asp Gly Arg Val Ser Ala Phe Ala 
405 410 415 

Leu S^xr Pro Asn 
420 



< 2 1 0 > 21 



WO 2004/022586 A B?BPCT/EP2003/009780 

203 

<211> 184 



<212> PRT 

<213> Angiococcus disciformis 
<400> 21 

Val Ser lie Arg Arg Thr Thr Ala Ser Thr Arg Asn Val Thr Val Ser 
15 10 15 



Ala Met Leu Asp Pro Ala Leu Gly Ser Ala Arg Asp Ala Leu Gin Val 
20 25 30 



Phe Leu Glu Lys Ala Leu Arg Ala Gin Phe Gly Ala Asp Trp Gin Gly 
35 40 45 



Thr Leu Glu Arg Asp Arg Leu Ala Gin Gly Arg Pro Thr Ser Glu Arg 
50 55 60 



Trp Asp Trp Gly Ser lie Ser Tyr Arg Leu Gly His Phe Arg Ser Pro 
65 70 75 80 



Gin Ala Thr Gly Ser Gly Val Val Thr Glu Pro lie Val Ala Leu Ala 
85 90 95 



Arg Leu Ala Lys Ala Tyr Arg Asn Leu Phe Gin His Glu Glu Leu Thr 
100 105 110 



Asp Ala Gin Val Arg His Ala Met Glu Gly Leu Ala Leu Leu Gin Glu 
115 120 125 



Ala Leu Gly Asn Lys Thr Arg Ala Ala Glu Thr Arg Ala Ala Leu Glu 
130 135 140 



Val Trp Ala Gly Pro Pro Ala Arg Ala Ala Lys Arg Pro Ala Arg Lys 
145 150 155 160 



Ala Pro Ala Ala Ala Lys Arg Ala Lys Pro Ala Ala Lys Lys Lys Ala 
165 170 175 



Pro Lys Ala Lys Ser Ala Arg Arg 
180 



<210> 22 
<211> 351 
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<212> PRT 

<213> Angiococcus disciformis 



<400> 22 

Met Thr Ser Gly Thr Arg Lys Thr Gly Gly Asp Gly Ala Arg Arg Asn 
15 10 15 

Asp Lys Tyr Arg He Leu Ser Leu Asp Gly Gly Gly Leu Arg Thr Leu 
20 25 3 0 



Leu Ser He Gly Leu Leu Lys Arg He Asp Gin Leu Arg Pro Gly Phe 
35 40 45 

Leu Asp Gin Val Asp Leu Val Ala Gly Thr Ser Ala Gly Ala He Ser 
50 55 60 

Ala Leu He He Ala Ala Ala Arg Glu Pro Ala Val Gly Leu Glu Gin 
65 70 75 80 

Ala Arg Gin He Trp Phe Thr Pro Gly Leu Phe Asp Ser Pro Leu Ser 
85 90 95 

Asn Gin Leu Gly Ala Leu Val Gly Gin Ser Ala Leu Met Pro Ser Glu 
100 105 HO 



Asn Met Ala Arg Ala Leu Thr His He Leu Gly Asp Lys Thr Leu Arg 
115 120 125 

Asp Leu Lys Arg Lys Val Val He Pro Ala Phe Gin Leu Asp Asp Gly 
130 135 140 

Asp Pro Asp Glu Asp Arg Arg Gly Trp Arg Pro Arg He Phe His Asn 
145 " 150 155 160 

Phe Pro Gly Asp Thr Phe Val Asn Leu Asp Asp Tyr Leu Val Asp Leu 
165 170 175 



Ala Leu Arg Ser Ser Ser Leu Pro He Val Ser Pro Val His Gin Gly 
180 185 190 

Tyr Val Asp Gly Gly Leu Phe Ala Asn Asn Pro Thr Met Ser Ala Val 
195 ~ 200 205 



Ala Gin Ala He Tyr Ala Lys Ala Ala Asp Val Arg Asp He Leu Val 
210 215 220 
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Phe Ser Leu Gly Thr Gly Asp Ser Val Asp Tyr Leu Asp Gly Tyr Asn 
225 230 235 240 

Glu Asn Trp Gly Trp Arg Lys Trp Leu Leu Asp Pro Lys Gin Pro Met 
245 250 255 

Ala Phe Val Ala Ala Thr lie Glu Ala Gly Val Glu Ala He Asp Phe 
260 265 270 

Gin Ala Lys Met Leu Leu Pro Arg Gly Asn Tyr Trp Arg Glu Asp Pro 
275 280 285 

Val Val Pro Phe Asn Leu Gly Asn Ser Val Glu Ala Gin He Ala Thr 
290 295 300 

Leu Asp Arg Val Val Lys Arg His Asp Leu Ser Lys Val Leu Glu Trp 
305 ^ ' 310 315 320 

Val Glu Asn Ser Gly Trp Gin Pro Lys Ala Thr Gin Glu Thr Ala Ser 
325 330 335 

Gly Ala Gin Arg Glu Phe Ser Gly Glu Ser Gly Glu Ala Arg Ser 
340 345 350 

<210> 23 
<211> 351 
<212> PRT 

<213> Angiococcus disciformis 



<400> 23 

Met Thr He lie His Gly Gin Pro Gly Ala Asp Gly Ala Arg Arg Asn 
15 10 15 

Gly Lys Tyr Arg He Leu Ser Leu Asp Gly Gly Gly Leu Arg Thr Leu 
20 25 30 



Leu Ser He Gly Leu Leu Lys Arg He Asp Gin Leu Arg Pro Gly Phe 
35 40 45 

Leu Asp Gin Val Asp Leu Val Ala Gly Thr Ser Ala Gly Ala He Ser 
50 55 60 



Ala Leu He He Ala Ala Ala Arg Glu Pro Ala Val Gly Leu Glu Gin 
65 70 75 80 
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Ala Arg Gin Leu Trp Leu Thr His Glu Leu Phe His Ser Ser Leu Arg 
85 90 95 



Thr Arg Leu Gly Ala lie lie Gly Met Asn Ala Leu Thr Pro Ser Lys 
100 105 110 



Asn Met Ala Arg Ala Leu Thr His lie Leu Gly Asp Lys Thr Leu Gly 
115 120 125 



Asp Leu Lys Arg Lys Val Val He Pro Ala Phe Lys Leu Asp Asp Glu 
130 ~ 135 140 

Asn Pro Asp Glu Asn Arg Arg Glu Trp Lys Pro Arg Val Phe His Asn 
145 150 155 160 

Phe Pro Gly Ala Asp Phe Val Asn Pro Gly Asp Arg Leu Val Asp Leu 
165 170 175 



Ala Leu Arg Ser Ser Ser Leu Pro He Val Ser Pro Val Tyr Gin Gly 
180 185 190 



Tyr Val Asp Gly Gly Leu Phe Ala Asn Asn Pro Thr Met Ala Ala Val 
195 200 205 



Ala Gin Ala He Tyr Ala Lys Ala Ala Asp Leu Arg Asp Leu Leu Val 
210 215 220 



Phe Ser Gin Gly Ala Gly Asp Met Val His Tyr Leu Lys Gly Tyr Asn 
225 230 235 240 



Glu Asn Trp Gly Trp Trp Lys Trp Leu Leu Asn Ser Lys Leu Pro Trp 
245 250 255 



Ala Leu Val Asp He Ala Val Ala Ala Gly Val Glu Ala He Asp Phe 
260 265 270 



Gin Ala Lys Arg Leu Leu Pro Arg Gly Asn Tyr Trp Arg Glu Asp Pro 
275 280 285 



Arg Val Pro Ser His Leu Gly Asn Ser Val Glu Ala Gin Val Val Thr 
290 295 300 

Asc Gin He Val Glu Lys His Asp Leu Ser Glu Ala Leu Glu Trp 
305 310 315 320 



Val Asp Ser Ser Gly Trp Leu pro Lys Gly Met His Glu Ala Ala Ser 
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325 330 335 

Gly Ala Gin Pro Glu Leu Ser Gly Glu Ser Gly Glu Ala Arg Ser 
340 345 350 

<210> 24 
<211> 405 
<212> PRT 

<213> Angiococcus discif onriis 



<400> 24 

Met Ala Ala Arg Cys Arg Gin Ala Arg Gly Ala Ser Met Thr Arg lie 
15 10 15 



Leu Leu Phe Ser Gly Lys Gly Gly Val Gly Lys Thr Thr Ala Ser Ala 
20 25 30 



Ala Thr Ala Val Ala Ala Ala Lys Arg Gly Tyr Arg Thr Leu Val Met 
35 40 45 



Ser Phe Asp lie Ala His Ser Leu Ser Asp Ser Phe Asp Leu Asp Arg 
50 55 60 



Lys Leu Phe Asp Phe Asn Glu Gly Leu Pro Gin Lys Val Ala Pro Asn 
65 70 75 80 



Leu Glu Leu Gin Glu He Asp He Gin His Glu Leu Gin Arg Gin Trp 
85 90 95 



Ser Glu Val Tyr Asn Tyr Met Ser Val Leu Leu Thr Ser Thr Gly Val 
100 105 110 



Ser Asn Met Val Ala Glu Glu Val Ala He Leu Pro Gly Thr Glu Asp 
115 120 125 



Val He Ser Leu He Tyr Leu Asn Gin Tyr Val Gin Gin Gly Arg Tyr 
130 135 140 



Asp Val He He Val Asp Cys Pro Pro Thr Gly Glu Ser Leu Arg Phe 
145 150 155 160 



Val Asn He Thr Ser Val Leu Gin Trp Tyr He Arg Arg Arg Phe Asn 
165 170 175 
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Val Asp Arg Thr Leu Val Lys Leu Ala Arg Pro Leu Ala Ser Arg Leu 
180 185 190 

Thr Ser Tyr Asp Leu Pro Glu Asp Ser Tyr Phe Ala Ser Leu Glu Gin 
195 200 205 

Leu Phe Asp Arg Val Lys Gly lie Glu Gly Leu Leu Thr Asp Gly Asn 
210 215 220 

Arg Thr Thr Val Arg Leu Val Ser Ser Ala Glu Lys Met Val He Arg 
225 230 235 240 

Glu Thr Gin Arg Ala Tyr Leu Tyr Phe Asn Met Tyr Gly Met Thr Val 
245 250 255 

Asp Gin Val Val Val Asn Arg Leu Leu Pro Asp Thr Glu Tyr Phe Ala 
260 265 270 

Gin Trp His Lys Ser Gin Ser Ala Tyr Leu Asp His He Gin Glu Tyr 
275 280 285 

Phe Ser Pro Met Pro Val Ala Arg Leu Pro Leu Leu Glu His Glu Val 
290 295 300 

Val Gly Leu Glu Arg Leu Glu Asp Leu Ala Gin Arg Leu Tyr Gly Asp 
305 310 315 320 

Ser Asp Pro Thr Glu Arg Tyr He Ser Gly Gin Pro Tyr Gin Phe Ala 
325 330 335 

Lys Gin His Glu Gly Arg Tyr Arg Leu Gin Leu Leu Met Pro Gly Ala 
340 345 350 

Asp Arg Gly Asp lie Val Leu Asp Arg Gin Gly Asn Asp Leu He He 
355 360 365 

Arg Val Gly Gly Phe Arg Arg His Val Met Leu Pro Arg Ser Val Ala 
370 375 380 

Gin Leu Asp Ala Val Asp Ala Val Leu Asp Lys Gly Met Leu Lys He 
385 390 395 400 

Asp Phe Ala Arg Pro 
405 



25 



# 
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<211> 373 
<212> PRT 

<213> Angiococcus disciformis 



<400> 25 

Val Asn Ala Val Arg Pro Val Met Gin Leu Lys Gly Ala Gly Ser Ala 
15 10 15 

Met Arg Met Thr Arg He Leu Gly Glu Gin Glu Leu Arg Gin He Val 
20 25 30 

Arg Ala Thr Gly Leu His Ala Leu Met Asp Glu Val He Ser Thr Leu 
35 ~ 40 45 

Glu Tyr Glu Leu Glu His Phe Asp Pro Ala Asn Thr Glu Leu Arg Lys 
50 55 60 

Arg Glu Gly Phe Gin Tyr Gin His Pro His Pro Gly Val Leu Glu Trp 
65 70 75 80 

Met Pro Val Met Gin Val Gly Arg His Ala Val He Lys Leu Val Gly 
85 90 95 

Tyr Asn Pro Arg Ser Pro Glu Glu Leu Gly Leu Pro Thr He Leu Ser 
100 105 HO 

Ser Leu Ser Leu Tyr Asp Val Thr Thr Gly His Leu Arg Ala Val Cys 
115 120 125 

Asp Gly Thr Phe Pro Thr Ala Leu Arg Thr Gly Ala Ala Ser Ala Val 
130 135 140 

Ala Ser Arg Leu Leu Ala Arg Glu Asp Ser Arg Val Leu Gly Leu Val 
145 150 155 160 

Glv Cys Gly Ala Gin Ala Val Thr Gin Ala His Ala Leu Ser Arg Val 
165 170 175 

Phe Pro Leu Gly Arg Val Leu Val His Asp He Ala Pro Glu Ala Met 
180 185 190 

Arg Ser Phe Ala Arg Arg Val Ala Phe Leu Gly Leu Glu Val Val Ala 
195 200 205 



Val Pro Leu Thr Arg Val Glu Gin Glu Ala Asp He Leu Cys Thr Ala 
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210 215 220 

Thr Ser Val Ala Val Gly Ala Gly Pro Val Val Ser Gly Arg Leu lie 
225 230 235 240 

Lys Ala His Leu His He Asn Ala Val Gly Ser Asp Leu Pro Gly Lys 
245 250 255 

Thr Glu Leu Pro Leu Glu Met Leu Arg Arg Ala Leu Val Cys Pro Asp 
260 265 270 

Tyr Leu Pro Gin Ala Val Val Glu Gly Glu Cys Gin Gin Leu Glu Pro 
275 280 285 

Gly Glu He Gly Pro Ser Leu Phe Glu Leu Thr Arg Ser Pro Glu Gin 
290 295 300 

Trp Arg Ser His Arg Asp Gly Leu Thr Val Phe Asp Ser Thr Gly Phe 
305 310 315 320 

Ser Leu Glu Asp Lys Val Val Leu Glu Val He Leu Ala His Ala Glu 
325 330 335 

Arg Leu Gly Val Gly Arg Leu Leu Gin Val Glu Ser Met Ala Glu Asp 
340 345 350 

Pro Lys Asp Pro Tyr Gly Phe Leu Gin Glu Thr Ala Thr Cys Leu Lys 
355 360 365 

Ala Ala Pro Ala Val 
370 

<210> 26 

<211> 219 

<212> PRT 

<213> Angiococcus disciformis 



<400> 26 

Met Arg He Leu Thr Asp Val Phe Leu Arg Lys Tyr Gin Arg Ala Pro 
1 5 10 15 

Gly Pro Arg Glu Arg Ser Leu Ala Ala Asp Ala Arg Ser Lys Leu Trp 
20 . 25 30 
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Leu Ser Val Gly Leu Thr Ala Phe Leu Val Pro Ala Tyr lie Glu Tyr 
35 40 45 



Phe Cys Arg Met Leu Gly Leu Pro Lys Gly Glu Leu Pro Trp Lys Leu 
50 55 60 



Tyr Asp Met Leu Arg Glu Arg Asn Glu Ala Pro Asp Leu Lys Gin Arg 
65 70 75 80 



Phe Gin Gin Leu Glu His Leu Pro Glu Gly Ser Phe Gly Arg Glu Phe 
85 90 95 



Trp Lys Tyr Cys His Ser His Gly Val Pro Leu Pro Gly His Pro His 
100 105 110 



Ala Leu Pro Val His Ser Ala Val His Asp Phe Val His Leu Leu Ser 
115 120 125 



Gly Tyr Gly lie Ser lie Trp Glu Glu Met Leu Thr Thr Ala Phe Ser 
130 135 140 



Leu Gly Phe Met Phe Gly Pro Arg Thr Arg Asn Tyr Asp Pro Ala Gin 
145 150 155 160 



Tyr Lys Ser Leu Val Pro Val Arg Leu Leu Ala Glu Ala Leu Pro lie 
165 170 175 



Pro Gin Leu Arg Arg Ala Val Glu Arg Gly Ser Ala Met Ser Val Asp 
180 185 190 



Leu Phe Gly Asp Trp Asp Pro Trp Lys Val Met Glu Leu Pro Leu Glu 
195 200 205 



Glu Val Lys Arg Met Tyr Asn lie Gin Pro Glu 
210 215 



<210> 27 

<211> 781 

<212> PRT 

<213> Angiococcus discif ormis 

<400> 27 



Met Gly His Glu Asp Ser Asp Lys Ala Leu Pro Asp Ser Lys Ala Asn 
15 10 15 
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Glu Val Glu Ser Ser Arg Asn Glu Ala Glu Trp Asn Glu Leu Pro He 
20 " 25 30 

Ser Ala Arg Asp Cys Glu Asp Leu Lys Lys Ser Cys Leu Leu Pro Ser 
35 40 45 

Thr Val Arg Lys Leu Gly Trp Arg Thr Asp Thr Asp Glu Asn Asn Arg 
50 55 60 

Lys Leu Leu Asn Trp Pro Glu Glu Trp Lys Gly Asp Ala Gly Gly Leu 
65 70 75 80 

Val Leu Val Tyr Pro Gly Leu Asn Tyr Ala Lys Val Lys Pro His Arg 
85 90 95 

Pro Arg Val Leu Arg Pro Lys Glu Ala Gin Leu Thr Leu Pro Thr Ala 
100 105 110 

Thr Phe Thr Asp His He Leu Arg Pro Tyr Thr Gin Asp Glu Val Glu 
115 ** 120 125 

Gly Glu Pro Ala Lys Val Lys Lys Tyr Glu Ala Pro Phe Gly Tyr Ala 
130 135 140 

Pro Gin He Phe Asp Thr Ala He Leu Cys Pro Met Leu Gin Asp Glu 
145 150 155 160 

j 

Glu Val Pro Met Ala Leu He Glu Gly Glu Lys Lys Ala Ala Leu Leu 
165 170 175 

Ala Gin Glu Gly Thr Pro Ala Leu Gly Leu Pro Gly He Ser Met Ala 
180 185 190 

His Asp Val Glu Ala Arg Arg Arg Ser Trp Asp Asp Leu Thr Asp Glu 
195 200 205 

Trp Val Pro His Pro Glu Ala Ala Ser Tyr Leu Ala Gin Gly Arg Thr 
210 215 220 

Val Phe Val Cys Gly Asp Ala Pro Asp Met Asp Gly Thr Asn Met Asn 
225 230 235 240 

v/ai Vai Arg Glu Val Val Arg He Ala Lys Gly Leu Ser Ala Arg Gly 
245 250 255 



. y » r£o Gin He Pro Tyr Val Asp His Asp Pro Gin His Glu Lys 

260 265 270 
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Ser Gly Val Asp Asp Trp Val His Arg Glu Met Lys Lys Leu Leu Ala 
275 280 285 



Ala His His His Lys Thr Glu Ala Trp Ala Arg Gin Phe Leu lie Val 
290 295 300 



Gin Tyr Leu Ala Ser Ala Arg Pro Ala Leu Pro Ser Ala Leu Cys Asp 
305 310 315 320 



Trp Leu Val Glu Gin His Glu Ala His Gly Gly Val Lys Glu lie Glu 
325 330 335 



He Lys Arg Ala Leu Val Trp Cys Ser Val Trp Phe Ser Thr Glu His 
340 345 350 



Gly He Phe Lys Lys Trp Val Gly Thr Leu Lys Arg Lys Lys Val Cys 
355 360 365 



Lys Pro Glu Val He Asp Arg Leu Val Gin Ala Leu Pro Arg Val Glu 
370 375 380 



Asn Arg Phe Glu Pro Arg Thr Trp Val Gin His Trp Met Glu Lys Asn 
385 390 395 400 



Gly Val Thr Phe Val Tyr Ala Ser Asp Gly Phe Gin He Gly Gly Arg 
405 410 415 



Met Phe Pro Ala Ser Glu Phe Leu Ser Gin He Glu He Glu Ser Thr 
420 425 430 



Asn Glu Arg Thr Cys He Ser Val Arg Asn Leu Gly Asn Ala Val Arg 
435 440 445 



Leu Trp Glu Ala Asp Gin Arg Arg Gly He Val Glu Gin Tyr Arg Glu 
450 455 460 



Lys Leu Lys Phe Arg Glu Asp Gly Arg Arg Gly Asp Ala Glu Glu Phe 
465 470 475 480 



He Arg Ala Thr Thr Gly Arg Leu Asp Pro Val Asp Leu Val Val Val 
485 490 495 



Leu His Phe He Trp Gin Val Lys Arg Lys Leu Phe Gly Lys Pro Val 
500 505 510 



Lys Asp His Leu Met Pro He Phe Val Gly Lys Gin Arg Gly Gly Lys 
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515 520 525 

Thr Thr Ala lie Lys Ser Leu Leu Gin Pro Leu Gly Asp Leu Val Gly 
530 535 540 

Phe Pro Ala Asp Leu Gin Phe Leu Thr Asp Glu Arg Gin Gin Phe Arg 
545 550 555 560 

Leu Thr Arg Cys Tyr Gly Met Phe Phe Asp Glu Met Ala Lys Ala Glu 
565 570 575 

Arg Val Ala Ser Asp Val Leu Lys Asn Arg lie Thr Ala Glu Arg lie 
580 585 590 

Glu Tyr Arg Val Leu Gly Val Gin Lys Leu Gin Thr Gly Thr Asn Asn 
595 600 605 

Ala Thr Phe lie Gly Ala Ser Asn Lys Gin Val Lys Glu lie Val Val 
610 615 620 

Asp Pro Thr Gly Met Arg Arg Phe Tyr Gin Leu Asp Cys Leu Pro Gin 
625 630 635 640 

Met Asn Trp Thr Ala Leu Glu Gin lie Asp Phe Leu Ala Met Trp Thr 
645 650 655 

Ser Val Asp Glu Lys Ala Glu Ser Pro Leu Leu Gly Gin Leu Ala Gin 
660 665 670 

Val Gin Ala Arg Gin Asp Glu lie Arg Ala Lys Asp Ser Val Glu Asp 
675 680 685 

Phe Phe Glu Arg Arg Cys Ala Gin Arg Asp Gin Trp lie Lys Ala Thr 
690 695 700 

Ala Leu Tyr Gly His Tyr Val Glu Asp Leu Lys Tyr Gin Gly Arg Asp 
705 710 715 720 

Aro Met Ala Phe Ser Leu Thr Lys Phe Gly Glu Arg Met Lys Glu Leu 
725 730 735 

Ala Gly Glu Glu Asn Thr Gly Trp Lys Arg Ser Asp Gly lie Lys Tyr 
740 745 750 



Arg Val Thr Val Lys Arg Trp Glu Thr Glu lie Asp Gly Val Met Glu 
755 ~ 760 765 
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Ser lie Ala Ala Gly Ala Arg Met Asp Leu Ala His Pro 
770 775 780 

<210> 28 

<211> 332 

<212> PRT 

<213> Angiococcus disciformis 



<400> 28 

Met Asp Thr Asp Lys Val Ala Leu Phe Gly Ala Ser Gly Pro lie Gly 
15 10 15 



His Ser He Val Lys Ala Leu Arg Glu Gin Gly Arg Ala Tyr Arg Val 
20 25 30 



Val Gly Arg Ser Arg Ala Ser Leu Gin Ser Gin Phe Gly Ala Asp Pro 
35 40 45 



Lys Ala Glu He Ala Thr Trp Asn Pro Asp Ala Pro Asp Thr lie Arg 
50 55 60 



Ala Ala Ala Arg Gly lie Arg Thr Leu He Tyr Met Val Gly Val Asp 
65 70 75 80 



Tyr Trp Arg Phe Gin Leu His Pro Glu Leu Met Arg Arg Thr Val Asp 
85 90 95 



Ala Ala He Ala Glu Gly Val Glu Arg Val Val Leu He Gly Thr Val 
100 105 110 



Tyr Pro Tyr Gly Arg Pro Arg Thr Thr Pro Val Thr Glu Asp His Pro 
115 120 125 



Arg Glu Pro His Thr Phe Lys Gly Arg Met Arg Lys Glu Gin Glu Asp 
130 135 140 



He Leu Leu Ala Ala Asp Ala Ala Gly Lys Leu Arg Ala Thr He Leu 
145 150 155 160 



Arg Leu Pro Asp Phe Tyr Gly Pro Gly He Asp Lys Ser Phe Leu His 
165 170 175 



Ser Ala Phe Val Ala Ala Ala Gin Gly Lys Arg Ala Gin Leu He Gly 
180 185 190 
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Pro lie Asp Pro Pro His Glu Tyr Val Tyr Val Pro Asp Val Gly Pro 
195 200 205 



Thr Val Thr Ala Leu Met Asp Ala Pro Gly Ala Tyr Gly Arg Trp Trp 
210 215 220 



Asn Leu Ala Gly Ala Gly Val Thr Ser Gin Arg Lys Leu Val Glu Glu 
225 230 235 240 



lie Tyr Ala Gin Ala Gly His Leu Ala Lys Phe Met Thr Ala Gly Lys 
245 250 255 



Gly Met Leu Arg Leu lie Gly Leu Phe Asn Pro Phe Met Arg Glu Met 
260 265 270 



Val Glu Met His Tyr Leu Leu Thr Glu Pro Val Leu Met Asp Asp Ser 
275 280 285 



Ala Leu His Lys Leu Leu Gly Ser Val Arg Lys Thr Pro Tyr Ser Glu 
290 295 300 



Gly lie Arg Gin Thr Leu Glu Ala Thr Arg Ala Ser Leu Val Ser Pro 
305 310 315 320 



Ala Pro Ala Ala Thr Gly Arg Pro His Pro Ala Pro 
325 330 



<210> 29 
<211> 547 
<212> PRT 

<213> Angiococcus discif ormis 
<400> 29 

Met Ala Ala Lys Glu He Phe Phe His Gin Ser Ala Arg Glu Ala He 
15 10 15 



Leu Arg Gly Val Arg Thr Leu Ala Asp Ala Val Ala Val Thr Leu Gly 
20 25 30 



Pro Lys Gly Arg Asn Val Val He Glu Lys Ser Phe Gly Ser Pro Thr 
35 40 45 



He Thr Lys Asp Gly Val Thr Val Ala Lys Glu He Asp Leu Asp Asn 
50 55 60 
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Lys Phe Glu Asn Met Gly Ala Gin Met Val Lys Glu Val Ala Ser Lys 
65 70 75 80 

Thr Ser Asp Lys Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala 
85 90 95 

Arg Ala He Tyr Glu Glu Gly Leu Lys Leu Val Ala Ala Gly His Ser 
100 105 11° 

Pro Met Asp Leu Lys Arg Gly He Asp Lys Ala Val Glu Val Val Val 
115 " 120 125 

Glu Glu Leu Lys Lys Leu Ser Lys Pro Thr Ala Asp Lys Lys Ala He 
130 135 140 

Thr Gin Val Gly Thr He Ser Ala Asn Gly Asp Glu Thr He Gly Ser 
145 ~ 150 155 160 

He He Ala Asp Ala Met Glu Lys Val Gly Lys Glu Gly Val He Thr 
165 170 175 

Val Glu Glu Ala Lys Gly Leu Glu Thr Asn Leu Asp Val Val Glu Gly 
180 185 190 

Met Gin Phe Asp Arg Gly Tyr Val Ser Pro Tyr Phe Val Thr Asn Arg 
195 200 205 

Glu Arg Met Glu Val Val Met Asp Asp Pro Phe He Leu He Ser Glu 
210 215 220 

Lys Lys Val Ser Ser Met Gin Asp Met He Pro He Leu Glu Gin Val 
225 230 235 240 

Ala Arg Ser Gly Lys Pro Leu Leu He He Ala Asp Asp He Glu Gly 
245 250 255 

Glu Ala Leu Ala Thr Leu Val Val Asn Lys He Arg Gly Val Leu Asn 
260 265 270 

Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Glu Met 
275 280 285 

Leu Lys Asp He Ala Thr Leu Thr Gly Gly Met Val Val Ser Glu Glu 
290 " 295 300 

Leu Gly His Lys Tyr Glu Asn Leu Thr Leu Asn Asp Leu Gly Arg Ser 
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305 310 " 315 320 

Lvs Arg He Thr Val Asp Lys Asp Asn Thr Thr He Val Asp Gly Ala 
325 330 335 

Gly Thr Lvs Ser Glu He Glu Gly Arg He Lys Leu He Arg Ser Gin 
340 345 350 

He Asp Thr Val Thr Ser Asp Tyr Asp Arg Glu Lys Leu Gin Glu Arg 
355 360 365 

Leu Ala Lys Leu Val Gly Gly Val Ala Val He Asn Val Gly Ala Ala 
370 375 380 

Thr Glu Thr Glu Met Lys Glu Lys Lys Ala Arg Val Glu Asp Ala Leu 
385 390 395 400 

His Ala Thr Arg Ala Ala Val Glu Glu Gly He Val Pro Gly Gly Gly 
405 410 415 

Val Ala Phe Leu Arg Cys Leu Pro Ala Leu Glu Lys Leu Lys Leu Gly 
420 425 430 

Gly Glu Gin Asp Phe Gly Val Glu He He Arg Arg Ala Leu Gin Glu 
435 440 445 

Pro Leu Arg Lys He Ala Ser Asn Ala Gly Val Glu Gly Ala Val Val 
450 455 460 

He Asn Lys Val Arg Glu Gly Gin Gly Ala Phe Gly Tyr Asn Ala Arg 
465 470 475 480 

Thr Glu Val Tyr Glu Asp Leu Glu Lys Ala Gly Val He Asp Pro Thr 
485 490 495 

Lys Val Glu Arg Thr Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu 
500 505 510 

Leu Leu Thr Thr Glu Ala Met He Ala Asp Arg Pro Ala Lys Lys Lys 
515 520 525 

Gly Lys Asn Gly Gly Gly Ala Gly Met Pro Asp Tyr Gly Gly Asp Asp 
tzpn 535 540 



Met; Asp Tyr 
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<210> 30 
<211> 746 
<212> PRT 

<213> Angiococcus disciformis 
<400> 30 

Val Thr Arg Arg Cys Arg Cys Pro Ser Cys Pro Pro Arg Ala Gly Gly 
! 5 10 15 

Ala Cys Ser Val Ala Trp Pro Ala Pro Tyr Arg Ala Arg Pro Arg Ala 
20 25 30 

Pro His Ser Phe Arg His Pro Pro Cys Pro Arg Arg Pro Leu Pro His 
35 40 45 

Pro Cys Pro His Arg Ser Arg Ser His His Pro Ala Pro Cys Leu Arg 
50 55 60 

Pro Pro Ser Gly Pro Arg Pro Leu Arg Pro Ser Arg Ser Pro Pro Ser 
65 70 75 80 

Arg Pro Arg Pro Ala Ala Pro Ser Ala Gly Ala Cys Pro Arg Asn Arg 
85 90 95 

Pro Pro Arg Arg Ala Gly Arg Thr Gly Ala Arg Val Ser Trp Asp Leu 
100 105 110 

Ser Leu Glu Gly Ala Ala Gin Leu Thr Ala Glu Arg Arg Gly He Gly 
115 120 125 

Gin Gin Leu Arg Lys Ala Arg Gly Gly Thr Gly Ala Gin Arg Leu Gin 
130 135 140 

Arg Arg Leu His Gly Glu Val Leu Gly Gin His Asp Glu Gly Gly Ala 
145 ~ 150 155 160 

Leu Gly Gin Arg Leu Gly Gin Leu Gin Arg Leu Ala Pro He Gly Arg 
165 170 175 

Arg Glu Val Pro Ala His His His Ala His Arg Arg Arg His Arg His 
180 185 190 



Gin Leu Arg Arg Gly Gin His Pro Arg Leu Gin Pro Gin His Val Gin 
195 200 205 
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Gly Leu Gly Asp Glu Pro Arg Arg Arg Arg Val Val Leu Gin His Val 
210 215 220 

His Pro Pro His Phe Gly Pro Gly Gly His His Arg Arg His Arg Arg 
225 230 235 240 

Leu Arg Gly Gin Arg Arg Ala Gin Gin Pro Ala His Leu Pro Asp Val 
245 250 255 

Val Glu Gly Leu Glu Gin Val Val His His Ala Gin Leu Gin Arg Leu 
260 265 270 

Leu Arg Arg His Gin Arg Arg Val Ala Arg His Gin Asp Asp Ala Arg 
275 280 285 

Val Gly Val Glu Pro Pro Arg Leu Leu Arg Gin Leu Gin Pro Gly Gin 
290 295 300 

Ala Arg Gin Val Leu Val Arg Asp Asp Glu Val His Trp Arg Leu Ala 
305 ^ 310 315 320 

Gin Gin lie Gin Arg Phe Leu Ala Ala Gly Arg Leu Asp Asp Leu Ala 
325 330 335 

Leu Leu Ala His Gin Val Ala Glu Asp His Ala Asp Asp Gly Leu Val 
340 345 350 

Val His His Gin Gin Pro Leu Ala Gin Leu Arg Arg Arg Val Arg Gly 
355 360 365 

Gly Glu Glu Tyr Pro Glu His Gly Gly Arg Arg Gly His Val Ala Asn 
370 375 380 

Gly Ala Arg Arg Arg Glu Ala Asn Arg Arg Val Val Leu Leu Gly Asp 
385 390 395 400 

Ala Leu Ala His Arg Gin Pro Gin Ser Arg Ala Ala Leu Val Gly Gly 
405 410 415 

Aso Val Gly Leu Glu Asp Ala Leu Ala Gin Leu His Gly Asp Ala Arg 
420 425 430 

Ala Pro Val Arg His Val Gin His Arg Arg Gly Ala Leu Ala Arg Glu 
435 440 445 



Kis his Leu His Ala Pro Ala Ala Gly His Gly Val Ala Arg Val Val 
450 455 460 
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Gin Gin Val Gin Arg Glu Leu Leu Asp Glu Pro Arg Val Ala Leu Asp 
465 470 475 480 



Gly Asp Val Leu Gly Arg Leu His Leu Gin Arg His Ala Arg Arg Val 
485 490 495 



Arg Val Asp Ala Glu Gin Leu Arg Arg Arg Pro Asp Gly Arg Arg Gin 
500 505 510 



Arg Gin Ala Leu Leu Arg Arg Val Pro Leu Ala Gly Glu Ala Asp Val 
515 520 525 



Val Gly Asp Asp Leu Phe His Pro Leu His Leu Arg Gin Gin Gin Pro 
530 535 540 



Gin Arg Ala Gly Gly Ala Thr Val Leu Arg Glu Gin Leu Arg Val Gly 
545 550 555 560 



Leu His Ala Gin Gin Gly Leu Pro Gin Leu Val Gin His Leu Arg Arg 
565 570 575 



Gin Leu Cys Arg Val Arg Ala Arg Thr Leu Gin Ser Arg Arg Arg Arg 
580 585 590 



Ala Arg Gly Arg Gin lie Ser Arg Leu Gin Gly Leu Glu Gin Leu Gly 
595 600 605 



Glu Arg Arg Gly Cys Phe His Gly Pro Ser Met Ala Glu Ala Ala Val 
610 615 620 



Ser Thr Gin Arg Gly Arg Arg Asp Ser Val Asp Pro Gly Pro Trp Pro 
625 630 635 640 



Arg Gly Cys Ser Ser Arg Pro Leu Ala Pro Arg Pro Glu Arg Gly Arg 
645 650 655 



His His Arg Pro Arg Glu Gly Leu His Ala Ala Asp Gin Pro Val Pro 
660 665 670 



Leu Ala Gly Ala Pro Glu Asp Asp Arg Leu Arg Ala Gly Pro Leu His 
675 680 685 



Pro Ala Glu His Arg Gly Gly Ala Gly Arg His Gin Pro Leu Arg Arg 
690 695 700 



Pro His Arg Cys His Arg Gin Arg Gly Gly Gin Gly His Pro Pro Ser 
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705 710 715 720 

Arg His Gly Arg Gly Ala Arg Ala Arg Arg Pro Leu Leu Ala Ala Asp 
725 730 735 

Cys Pro Val Gly Pro Arg Ala Gly Pro Ala 
740 745 



<210> 31 
<211> 417 
<212> PRT 

<213> Angiococcus disciformis 
<400> 31 

Val Thr Ala Val Asp Asn Gly Leu Asp Ala lie Ser Arg Thr Arg Glu 
15 10 15 

Leu Arg Pro Asp Val Val Leu Ala Asp Val Met Met Pro Gly Lys Ser 
20 25 30 

Gly Tyr Glu Val Cys Glu Ala Leu Lys Ser Asp Pro Ala Thr Gin Gly 
35 40 45 

He Pro Val Val Leu Leu Ala Gly Thr Phe Glu Ala Phe Asp Glu Asn 
50 55 60 

Arg Ala Arg Ala Ala Arg Ala Asp Asp His He Ala Lys Pro Phe Glu 
65 70 75 80 

Ser Gin Val Leu Leu Asp Lys Val Lys Ala Leu Val Gly Gin Lys Ser 
85 90 95 

Asn Thr Met Pro Ala Ser Ala Ala Thr Gin Val Arg Pro Pro Thr Ala 
100 105 HO 

Gin Pro Ala Thr Pro Ala Pro Val Pro Pro Gly Pro Gin Pro Gly Ala 
115 120 125 

Pro Thr Ala Ala Arg Pro Gly He Pro Gin Gly Thr Gin Pro Gly Val 
130 135 140 

Pro Arg Pro Pro Gly Pro Gly Met Pro Pro Gly Ala Pro Gly Ala Pro 
145 " 150 155 160 
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Arg Pro Pro Gly Pro Gly Met Pro Pro Gly Pro Gly Met Ala Arg Pro 
165 170 175 

Pro Gly Pro Gly Met Pro Pro Pro Gly Ala Pro Gly Ala Pro Arg Pro 
180 185 190 

Pro Gly Pro Gly Met Pro Pro Gly Pro Gly Met Ala Arg Pro Pro Gly 
195 200 205 

Pro Gly Met Pro Pro Pro Gly Ala Pro Gly Ala Pro Arg Pro Pro Gly 
210 215 220 

Pro Gly Met Pro Pro Pro Gly Ala Ala Ala Arg Pro Gly Arg Ala Ala 
225 230 235 240 

Ser Ala Gly Cys Val Arg Ala Gly His Ala Ala Ala Ser Gly Arg Ala 
245 250 255 

Arg Ala Ser Gly Trp Ala Gly Ala Gly Met Ala Arg Pro Gly He Pro 
260 265 270 

Gin Gly Ser Gin Pro Gly Phe Pro Arg Pro Pro Gly Gly Ala Ala Pro 
275 280 285 

Leu Pro Pro Ala Pro Gly Gly Ala Thr Gin Pro Ala Ala Arg Ala Arg 
290 295 300 

Asp Pro Phe Gly Leu Gly Ala Pro Thr Ala Ala Pro Gin Ala Ala Gin 
305 ~ 310 315 320 

Pro Ser He Ser He Glu Asp Ser Leu Pro Glu Pro Ser Gly Ala Glu 
325 330 335 

Glu He Ser Leu Asp Met Gly Gly Pro Pro Thr Ala Ala Pro Thr Ala 
340 345 350 

Val Ala Pro Thr Ala Ala Pro Ala Thr Ala Arg Pro Thr Ala Ala Asp 
355 360 365 

Gly Gly Glu Ala Leu Leu Arg Glu Ala Leu Ser Lys Ala Ser Arg Glu 
370 375 380 

Val He Glu Lys He Ala Trp Glu Val Val Pro Gin Leu Ala Glu Thr 
385 ~ 390 395 400 



He He Arg Glu Glu Leu Glu Arg Leu He Lys Asp Arg Glu Thr Gin 
405 410 415 
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<210> 32 
<211> 1102 
<212> PRT 

<213> Angiococcus disciformis 
<400> 32 

Met Glu Arg Asn Tyr Phe Arg Ala Glu Ala Thr Ser Asp Lys Pro Ala 
15 10 15 

Phe Ser lie Val Leu Pro Pro Pro Asn Val Thr Gly Ser Leu His He 
20 25 30 

Gly His Ala Leu Thr Ala Thr He Gin Asp He Leu Ala Arg Trp Lys 
35 40 45 

Arg Met Ser Gly Phe Asn Thr Leu Trp Leu Pro Gly Thr Asp His Ala 
50 55 60 

G'y He Ala Thr Gin Met Val Val Glu Lys Glu Leu Lys Lys Thr Glu 
65 70 75 80 

Gly Lys Ser Arg His Asp Leu Gly Arg Ala Ala Phe Leu Glu Arg Val 
85 90 95 

Trp Glu Trp Lys Gly Lys Tyr Gly Ala Arg He Gly Glu Gin His Arg 
100 " 105 110 



Phe Leu Gly Ala Ser Leu Asp Trp Ser Arg Glu Arg Phe Thr Met Asp 
115 120 125 

Glu Gin Ser Ser Ala Ala Val Arg Glu Val Phe Val Arg Leu Tyr Glu 
130 135 140 

Glu Glv Leu Met Tyr Arg Ala Gin Lys Leu He Asn Trp Cys Pro Ser 
145 150 155 160 

~--s *rg Thr Ala Leu Ser Asp Leu Glu Val Glu His Glu Glu Ser Lys 
165 " 170 175 



Lys Li'-* ser He Trp His He Arg Tyr Pro Val Lys Asp Ser Asp Arg 
180 ^ 185 190 
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Thr Leu Thr Val Ala Thr Thr Arg Pro Glu Thr Met Leu Gly Asp Thr 
195 200 205 

Ala Val Ala lie His Pro Glu Asp Glu Arg Tyr Leu Gly Leu Ala Gly 
210 215 220 

Lys Ser Val Lys Leu Pro Leu Thr Asp Arg Glu lie Pro He He Ala 
225 230 235 240 

Asp Ala Glu Leu Val Asp Pro Lys Phe Gly Thr Gly Val Val Lys Val 
245 250 255 

Thr Pro Ala His Asp Phe Asn Asp Tyr Gin Thr Gly Leu Arg His Lys 
260 265 270 

Leu Pro Met Leu Thr He Leu Asp Glu Ala Ala Arg Met Thr Lys Glu 
275 280 285 

Thr Gly Lys Tyr Ala Gly Met Asp Arg Phe Glu Ala Arg Lys Gin Val 
290 295 300 

Leu Ala Asp Leu Thr Glu Gin Gly Leu Leu Glu Lys Glu Glu Pro His 
305 310 315 320 

Met Leu Asn Val Gly Thr Cys Gin Arg Cys Ala Thr Val Val Glu Pro 
325 330 335 

Arg Leu Ser Pro Gin Trp Phe He Lys He Glu Pro Leu Ala Lys Pro 
340 345 350 

Ala He Glu Ala Val Glu Gin Gly Arg Thr Lys Phe Val Pro Glu Ser 
355 360 365 

Trp Thr Asn Thr Phe Phe His Trp Met Arg Asn He His Asp Trp Cys 
370 375 380 

Val Ser Arg Gin Leu Trp Trp Gly His Gin He Pro Ala Tyr Tyr Cys 
385 390 395 400 

Thr Ser Cys Ser Pro Arg Gin Gly Asp Asp Thr Asp Leu Pro Leu Asp 
405 410 415 

Ala Pro Thr Val Lys Val Gly Gly Val Asp Phe Ala Arg Ala Glu Pro 
420 425 430 



He Val Ala Arg Glu Gin Pro Lys Ser Cys Pro Lys Cys Gly Gly Ala 
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435 . 440 4 45 

Ser Phe He Gin Asp Pro Asp Val Leu Asp Thr Trp Phe Ser Ser Gly 
450 455 460 

Leu Trp Pro Phe Ser Thr Leu Gly Trp Pro Arg Glu Thr Pro Glu Leu 
465 470 475 480 

Lys Thr Phe Tyr Pro Thr Ser Val Met Glu Thr Gly His Asp He He 
485 490 495 

Phe Phe Trp Val Ala Arg Met Met Met Met Gly He His Phe Met Gly 
500 505 510 

Asp Val Pro Phe Arg Thr Val Tyr Leu His Ala Met Val Arg Asp Glu 
515 52 0 525 

Lys Gly Gin Lys Met Ser Lys Thr Lys Gly Asn Val He Asp Pro Leu 
530 535 540 

Asp He Val Leu Gly Ala Pro Ala Asp Lys Leu Gin Pro Thr Leu Lys 
545 550 555 560 

Asn Arg Phe Pro Gin Gly Met Pro Pro Phe Gly Ala Asp Ala Leu Arg 
565 570 575 

Phe Thr Leu Ala Ser Leu Thr Gin Gin Gly Arg Asp He Lys Leu Ser 
580 585 590 

Met Asp Arg Leu Glu Gly Tyr Lys Ala Phe Gly Asn Lys Leu Trp Asn 
595 600 605 

Ala Ser Arg Phe Ala Leu Met Asn Met Gly Asp Phe Arg Met Glu Ala 
610 615 620 

Gly Pro Leu Asp Arg Ser Thr Leu Thr Leu Ala Asp Arg Trp He Leu 
625 630 635 640 

Ser ^rg Leu Gin Arg Ala Thr Thr Glu Ala Arg Ala Leu Leu Glu Ser 
645 650 655 

Tyr Ser Phe Gly Glu Ala Ala Ser Thr Leu Tyr Gin Phe Leu Trp Ala 
660 665 670 



Glu Phe Cys Asp Trp Tyr He Glu Leu Ala Lys Gly Ser Leu Tyr Gly 
675 680 685 
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Glu Asp Ala Ala Ala Lys Asp Ser Thr Arg Arg Val Leu Val Tyr Cys 
690 695 700 

Leu Asp Arg He Leu Arg Leu Leu His Pro Phe Met Pro Phe He Thr 
705 ~ 710 715 720 

Glu Glu He Trp Gin Lys Leu Pro Met Ala Arg Pro Thr Asp Ser He 
725 730 735 

Met He Ala Pro Tyr Pro Glu Pro Asp Ala Ala His Val Asp Glu Ala 
740 745 750 

Ala Glu Gly Glu Met Ala Pro Val He Ala Ala He Glu Gly Leu Arg 
755 760 765 

Thr He Arg Gly Glu Ser Asn Leu Ser Pro Ala Thr Lys Val Lys Ala 
770 775 780 

Val Val Gin Ser Ala Asp Val Arg Thr Arg Glu Leu Leu Glu Arg Trp 
785 790 795 • 800 

Arg Gly Tyr Leu Met Pro Leu Ala Gly Leu Ser Glu Val Thr Val Gly 
805 810 815 

Ala. Pro Gly Ala Lys Pro Pro Gin Ala Ala Ala Phe Val Gly Ser Asn 
820 825 830 

Leu Glu He Tyr Val Pro Leu Ala Gly Leu Val Asp Leu Asp Ala Glu 
835 840 845 

Arg Glu Arg Leu Lys Lys Glu He Ala Arg Ala Glu Gin Glu Met Ala 
850 855 860 

Gly Val Leu Arg Lys Leu Glu Asn Pro Asn Phe Val Ala Lys Ala Pro 
865 870 875 880 

Pro Asp Val Val Glu Lys Asp Arg Ala Arg Val Glu Glu Leu Lys Glu 
885 890 895 

Arg Lys Ala Lys Leu Gin Asp His Leu Gin Arg He Ala Pro Glu Pro 
900 905 910 

Ala Met Pro Glu Thr Pro Pro Ser Glu Ser Thr Thr Pro Glu Ser Ser 
915 920 925 



Thr Pro Thr Ala Glu Glu Gly Ala Ser Pro Ser Ala Pro Ala Gin Val 
930 935 940 
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Lys Val Ala Pro Asn Gin Glu Glu Lys Gly Gly Val Asp Leu Gly Gin 
945 950 955 960 



Glu Leu Lys Gly Glu Leu Gly Glu Thr Gly Thr Ala Ser Glu Ala Gly 
965 970 975 



Asp Thr Gin Val Gin Asp Ala Leu Ala Lys Leu Arg Glu Gly Thr Lys 
980 985 990 



Glu Gly Leu Ser Pro Ser Asp His His Asp Leu Gly Val Ala Tyr Met 
995 1000 1005 



Thr Met Gly Leu Val Asp Asp Ala Met Arg Glu Phe Asn Arg Ala 
1010 1015 1020 



Lys Glu Gly Gly Asp Thr Arg Glu Thr Pro Ala Glu Gly Ala Glu 
1025 1030 1035 



Ala Glu Glu Gly Glu Ser Gly Pro Val Lys Ala His Gly Asp Met 
1040 1045 1050 



Leu Pro Thr Arg Glu Val Thr Gin Gly Glu Pro Val Arg Ser Ala 
1055 1060 1065 



Ser Lys Lys Lys Ala Thr Ala Pro Gly Lys Val His Gly Asp Lys 
1070 1075 1080 



Leu Ala Pro Ala Pro Thr Thr Glu Glu Ala Glu Ala Pro Ala Val 
1085 1090 1095 



Lys Lys Ala Ala 
1100 



<210> 33 

<211> 21953 

<212> DNA 

<213> Angiococcus disciformis 
<220> 

<221> CDS 

<222> (1) . . (7878) 

<223> tube 
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<220> 

<221> CDS 

<222> (18408) . . (21953) 

<223> tubE 



<400> 33 

ttg age gec gga gcg ttg etc gec cac gec gcg age ctg ggt gta cgc 

Leu Ser Ala Gly Ala Leu Leu Ala His Ala Ala Ser Leu Gly Val Arg 
15 10 15 

ctg tgg gta gag ggc gag cga ctg cgc ttc cag get ccc cca ggg gtg 
Leu Trp Val Glu Gly Glu Arg Leu Arg Phe Gin Ala Pro Pro Gly Val 
20 25 30 



ate gca ctg eta egg cag etc cag ccc tec tct cag gga ggc age etc 
He Ala Leu Leu Arg Gin Leu Gin Pro Ser Ser Gin Gly Gly Ser Leu 
50 55 60 

ctg gec cca gtg get cga aac ggg egg ctg gcg etc tec ttc gee cag 
Leu Ala Pro Val Ala Arg Asn Gly Arg Leu Ala Leu Ser Phe Ala Gin 
65 70 75 80 

cag egg ctg tgg ttc cag gag cag etc cac cct gag gee ccg gee aac 
Gin Arg Leu Trp Phe Gin Glu Gin Leu His Pro Glu Ala Pro Ala Asn 
85 90 95 



ccg tgg cag ccg gcg ctg gag gtg gaa gcg eta ccg ggt gee ace gtg 
Pro Trp Gin Pro Ala Leu Glu Val Glu Ala Leu Pro Gly Ala Thr Val 
145 ~ 150 155 160 



cgc ctg gee gag cag cag cac gtg ctg gtg etc age etc cac cac ate 
Arg Leu Ala Glu Gin Gin His Val Leu Val Leu Ser Leu His His He 



48 



96 



atg acg ccc gag ctg cag tec cgc ctg gga ggc get cgc cat gag etc 144 
Met Thr Pro Glu Leu Gin Ser Arg Leu Gly Gly Ala Arg His Glu Leu 
35 40 45 



192 



240 



288 



aac etc acg ggg gee gtg gtg ttc acc ggc ccg ctg cac gtg gee gcg 336 
Asn Leu Thr Gly Ala Val Val Phe Thr Gly Pro Leu His Val Ala Ala 
100 105 HO 

ctg etc ggg gcg gtg get gcg ctg gtg agg cga cac gaa gcg ctg cgc 3 84 

Leu Leu Gly Ala Val Ala Ala Leu Val Arg Arg His Glu Ala Leu Arg 
115 120 125 

acc acc ctg ggc gag gag ggc gga gtc ccc tat teg etc ate ggc gag 432 
Thr Thr Leu Gly Glu Glu Gly Gly Val Pro Tyr Ser Leu He Gly Glu 
13 0 135 140 



480 



ggc gag egg ctg gag cag gee cga gag gtg gee ctg gee gag tec egg 52 8 

Gly Glu Arg Leu Glu Gin Ala Arg Glu Val Ala Leu Ala Glu Ser Arg 
165 170 175 

cgc cgc ttc gee ctg gag acg gag ccc cac ctg cgc gtg cgc ctg ctg 
Arg Arg Phe Ala Leu Glu Thr Glu Pro His Leu Arg Val Arg Leu Leu 
180 185 190 



576 



624 
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195 200 205 

gcg gcg gat gga gtc ggg ctg cag gtg etc gaa cag gag ctg gcg gcg 672 
Ala Ala Asp Gly Val Gly Leu Gin Val Leu Glu Gin Glu Leu Ala Ala 



210 215 220 

etc tac ggc gcg ctg age get ggg gee gag ccg egg eta cct ccc ctg 

Leu Tyr Gly Ala Leu Ser Ala Gly Ala Glu Pro Arg Leu Pro Pro Leu 

225 " 230 235 240 



caa gee cag gtg ctg cgc gcg ctg ggg caa ggc gag ggc gec acc etc 
Gin Ala Gin Val Leu Arg Ala Leu Gly Gin Gly Glu Gly Ala Thr Leu 
305 310 315 320 

tac atg acg etc ctg gee gee ctg ggc gtg ctg ctg caa egg tgg acg 
Tyr Met Thr Leu Leu Ala Ala Leu Gly Val Leu Leu Gin Arg Trp Thr 
325 330 335 

ggc cag cac gat atg gcg gtg ggc age gee gee gee aac cgc aac cgc 
Gly Gin His Asp Met Ala Val Gly Ser Ala Ala Ala Asn Arg Asn Arg 
340 345 350 



gca cgc agg gtg tgt gtg gag gee tat gee cat cag gag ttg ccc ttc 
Ala Arg Arg Val Cys Val Glu Ala Tyr Ala His Gin Glu Leu Pro Phe 
385 " 390 395 400 



720 



ccc ctg cag gtg gcg gac ctt gee gac tgg cag cgc egg tgg gtg gag 768 
Pro Leu Gin Val Ala Asp Leu Ala Asp Trp Gin Arg Arg Trp Val Glu 
245 250 255 

gga gag gag tac cag gtc cag etc get tac tgg egg cga cag etc gee 816 
Gly Glu Glu Tyr Gin Val Gin Leu Ala Tyr Trp Arg Arg Gin Leu Ala 
260 265 270 

ggg etc acg ccc ctg gag gta cca gga gat cat cct cgc ccg cgt ate 
Gly Leu Thr Pro Leu Glu Val Pro Gly Asp His Pro Arg Pro Arg lie 
275 280 285 

ccc tec atg aga ggc gcg gag gta egg gcg ccc ctg etc tec get ccg 912 
Pro Ser Met Arg Gly Ala Glu Val Arg Ala Pro Leu Leu Ser Ala Pro 
290 295 300 



864 



960 



1008 



1056 



ccg ggc ttg gag ggc ate etc ggc ttc etc etc aac ate gtc ctg ctg 1104 

Pro Gly Leu Glu Gly lie Leu Gly Phe Leu Leu Asn lie Val Leu Leu 
355 360 365 

egg etc gac ttg aga ggc cgc ccc cgc ttc cgc gag ttg ctg cgt cag 1152 

Arg Leu Asp Leu Arg Gly Arg Pro Arg Phe Arg Glu Leu Leu Arg Gin 
370 375 380 



1200 



gag cac ctg gtg gag gcg etc cag cca ggc age gag cgc ggg gac age 1248 

iVlu His Leu Val Glu Ala Leu Gin Pro Gly Ser Glu Arg Gly Asp Ser 
405 410 415 

teg etc tat cgc gtg gcg etc gcg gtg age gac acg cca tgg atg ccc 1296 

Ser Leu Tyr Arg Val Ala Leu Ala Val Ser Asp Thr Pro Trp Met Pro 
420 425 430 

~:jc cac ggg ctg aag ctg gag ggg gtg caa gcg cag ccg etc gac ttt 13 44 

•Jly His Gly Leu Lys Leu Glu Gly Val Gin Ala Gin Pro Leu Asp Phe 
435 440 44 5 

ccc egg ggg gtg ctg gac ctg gat etc cac ctg tgg gtg tat gac acc 13 92 
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Pro Arg Gly Val Leu Asp Leu Asp Leu His Leu Trp Val Tyr Asp Thr 
450 ~ " 455 460 

ggg gag ggg etc acc ggg egg ttg gag tac gec gtg gat ctg tat gag 1440 
Gly Glu Gly Leu Thr Gly Arg Leu Glu Tyr Ala Val Asp Leu Tyr Glu 
465 ~ 470 475 480 



gag ccc acg gca agg egg etc ctg gag ggc ttc cgc cag gtg ctg gag 
Glu Pro Thr Ala Arg Arg Leu Leu Glu Gly Phe Arg Gin Val Leu Glu 
485 490 495 



cgc ccc tat ccg cga gag gec age gtt cac ggc etc ttt cag cag cgt 
Arg Pro Tyr Pro Arg Glu Ala Ser Val His Gly Leu Phe Gin Gin Arg 
530 535 540 

gec etc cag gec ccg egg gec gtg gca gtg gtg tat ggc gag cgc age 
Ala Leu Gin Ala Pro Arg Ala Val Ala Val Val Tyr Gly Glu Arg Ser 
545 550 555 560 



gag cgc teg ccg gag cag gtg gag tec atg ctg gec gta etc caa gee 
Glu Arg Ser Pro Glu Gin Val Glu Ser Met Leu Ala Val Leu Gin Ala 
595 600 605 

ggg gee gee tac gtg ccg ctg gat cct tec tac ccc gta cag cgc cag 
Gly Ala Ala Tyr Val Pro Leu Asp Pro Ser Tyr Pro Val Gin Arg Gin 
610 615 620 

gag ttc atg etc cag gac age ggc gcg egg ttg etc gtg cac age ggc 
Glu Phe Met Leu Gin Asp Ser Gly Ala Arg Leu Leu Val His Ser Gly 
625 630 635 640 

ccg ctg ccc ttc get ccc cag ggc tgc gec acg ctg gat etc cag gca 
Pro Leu Pro Phe Ala Pro Gin Gly Cys Ala Thr Leu Asp Leu Gin Ala 
645 650 655 

tgg cac ccc get ccc tec gac ggt ggc gag ccc ctg ccc cag tgc age 
Trp His Pro Ala Pro Ser Asp Gly Gly Glu Pro Leu Pro Gin Cys Ser 
660 665 670 

gga gaa gat ctg gcg tat gtc ate tat acg tec ggc tec acc ggc cag 
Gly Glu Asp Leu Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr Gly Gin 
675 680 685 



1488 



gcg gtg gtg gag gcg ccg gat egg ccc gtg ccg gag ctg ccg gtg ctg 153 6 

Ala Val Val Glu Ala Pro Asp Arg Pro Val Pro Glu Leu Pro Val Leu 
500 505 510 

gga gag cag gag cgc cac cag gtg etc tec ggg tgg aac cgc aca cag 1584 
Gly Glu Gin Glu Arg His Gin Val Leu Ser Gly Trp Asn Arg Thr Gin 
515 520 525 



1632 



1680 



etc acc tac ggc gag ttg gcg gag egg gec cgc ggg etc gca cag ggt 1728 
Leu Thr Tyr Gly Glu Leu Ala Glu Arg Ala Arg Gly Leu Ala Gin Gly 
565 570 575 

ctg gtg gee cga ggc gtg egg cga gga gac ctg gtc gee ctt egg ctg 1776 
Leu Val Ala Arg Gly Val Arg Arg Gly Asp Leu Val Ala Leu Arg Leu 
580 585 590 



1824 



1872 



1920 



1968 



2016 



2064 



ccc aag ggc gtg gee gtg tgc cac egg gec atg acc cgc ctg gtg tgc 
Pro Lys Gly Val Ala Val Cys His Arg Ala Met Thr Arg Leu Val Cys 
690 695 700 



2112 
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aac acg gac tac gtt cag etc ggg cca gag gat egg gtc gec cag get 2160 

Asn Thr Asp Tyr Val Gin Leu Gly Pro Glu Asp Arg Val Ala Gin Ala 
705 710 715 720 

tec aat gee tec ttt gac gcg gee acc ttc gag gtc tgg ggc gcg ctg 2208 
Ser Asn Ala Ser Phe Asp Ala Ala Thr Phe Glu Val Trp Gly Ala Leu 
725 730 735 

i 

etc aat ggg gca egg ctg gtg ggg etc gee acg gag gag gee att cag 2256 
Leu Asn Gly Ala Arg Leu Val Gly Leu Ala Thr Glu Glu Ala He Gin 
740 745 750 

gcg cgc egg ctg gee gag gtc ctg cgc gag cag egg ate tec gtg etc 23 04 

Ala Arg Arg Leu Ala Glu Val Leu Arg Glu Gin Arg He Ser Val Leu 
755 760 765 

ttc gtc acc acc gee etc ttc aac cac gtg gee egg gag caa ccc cag 2352 
Phe Val Thr Thr Ala Leu Phe Asn His Val Ala Arg Glu Gin Pro Gin 
770 775 780 

gee ttc age acc ctg cgc tac ctg etc ttt ggc ggc gag gca gtg gat 2400 
Ala Phe Ser Thr Leu Arg Tyr Leu Leu Phe Gly Gly Glu Ala Val Asp 
785 790 795 800 

gec tec agt gtg agg egg gtg ctg aaa cag ggc get cca gga cac ctt 2448 
Ala Ser Ser Val Arg Arg Val Leu Lys Gin Gly Ala Pro Gly His Leu 
805 810 815 

eta cac gtg tac ggc ccc acg gag aac acc acc ttc tec acc gee tgg 2496 
Leu His Val Tyr Gly Pro Thr Glu Asn Thr Thr Phe Ser Thr Ala Trp 
820 825 830 

egg gtg gaa cac ctg gee gag cag gcg cat acc gtg ccc atg ggt cat 2544 
Arg Val Glu His Leu Ala Glu Gin Ala His Thr Val Pro Met Gly His 
835 840 845 

ccc ate gec aac teg egg ctg cac gtg ctg gac gaa gcg ctg cag ccg 2592 
Pro lie Ala Asn Ser Arg Leu His Val Leu Asp Glu Ala Leu Gin Pro 
850 855 860 

gtg ccg gtg ggc gee atg ggc gag gta tac ctg gga ggt gat ggg ctg 2640 
Val Pro Val Gly Ala Met Gly Glu Val Tyr Leu Gly Gly Asp Gly Leu 
865 870 875 880 

gcg ctg ggc tac tgg cgc cac ccc gag gee acc gcg gag cgc ttc gtg 2688 
Ala Leu Gly Tyr Trp Arg His Pro Glu Ala Thr Ala Glu Arg Phe Val 
885 890 895 

ccg gat ccg cac ggc ctg gag ccc gga ggc egg etc tac cgc acg gga 2736 
Pro Asp Pro His Gly Leu Glu Pro Gly Gly Arg Leu Tyr Arg Thr Gly 
900 905 910 

gat ctg gcg egg egg cag gcg gat ggg gee gtg gtg ttc get ggc cgc 2784 
Asp Leu Ala Arg Arg Gin Ala Asp Gly Ala Val Val Phe Ala Gly Arg 
915 920 925 

gtg gac agg cag gtc aag ctg cga ggc ttc cgc gtg gag ccg gcg gag 2832 
Val Asp Arg Gin Val Lys Leu Arg Gly Phe Arg Val Glu Pro Ala Glu 
930 935 940 

ate gag teg cac ctg tgc gag cac teg gag gtg tec gcg gcg gtg gtg 2880 
lie Clu Ser His Leu Cys Glu His Ser Glu Val Ser Ala Ala Val Val 
945 950 955 960 
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gag ctg egg ggc gaa ggg gec ctg egg egg ctg gtc gec tat gtc gtc 
Glu Leu Arg Gly Glu Gly Ala Leu Arg Arg Leu Val Ala Tyr Val Val 
965 970 975 

ccc cgc gec ggt ggc cgc cca ggt gee gag gag ctg cgc ace ttc etc 
Pro Arg Ala Gly Gly Arg Pro Gly Ala Glu Glu Leu Arg Thr Phe Leu 
980 985 990 

cgc ace cga ctg ccc gag tac atg ctg ccc gee age ttc tec ctg ctg 
Arg Thr Arg Leu Pro Glu Tyr Met Leu Pro Ala Ser Phe Ser Leu Leu 
995 1000 1005 

gag gca ctg ccc etc ace ccc aac ggc aag gtg gat cgc teg gcg 
Glu Ala Leu Pro Leu Thr Pro Asn Gly Lys Val Asp Arg Ser Ala 
1010 1015 1020 

eta ccc gag age ttc gag gag gee tec cgc gag cag get ccg gtg 
Leu Pro Glu Ser Phe Glu Glu Ala Ser Arg Glu Gin Ala Pro Val 
1025 1030 1035 

gta cct cca cga ggc ccc gtg gag gcg ctg ctg gtg gac ate tgg 
Val Pro Pro Arg Gly Pro Val Glu Ala Leu Leu Val Asp lie Trp 
1040 1045 1050 

agg gag gtg etc ggc ace caa egg gta tec gtg cac gac gac ttc 
Arg Glu Val Leu Gly Thr Gin Arg Val Ser Val His Asp Asp Phe 
1055 1060 1065 

ttc gac ctg ggc gga cac tec ctg ctg gee acg egg gtt gtc tec 
Phe Asp Leu Gly Gly His Ser Leu Leu Ala Thr Arg Val Val Ser 
1070 1075 1080 

cgc ttg cga gag gcg ctg cag gtg gag ttg ccg ctg cgc ace etc 
Arg Leu Arg Glu Ala Leu Gin Val Glu Leu Pro Leu Arg Thr Leu 
1085 1090 1095 



2928 



2976 



3024 



3069 



3114 



3159 



3204 



3249 



3294 



ttc gag gee cct cag etc teg 
Phe Glu Ala Pro Gin Leu Ser 
1100 1105 



gcg ctg gcg gee cag gtg gag gtc 
Ala Leu Ala Ala Gin Val Glu Val 
1110 



3339 



ctg etc ggc cac egg cag etc cgc ccc cct ccg etc gtg ccc get 
Leu Leu Gly His Arg Gin Leu Arg Pro Pro Pro Leu Val Pro Ala 
1115 1120 1125 



3384 



gta cgt cca cca gag ctg ccc 
Val Arg Pro Pro Glu Leu Pro 
1130 1135 



etc tec ttc gec cag cag egg ctg 
Leu Ser Phe Ala Gin Gin Arg Leu 
1140 



3429 



tgg ttc 
Trp Phe 
1145 

ctg gat 
Leu Asp 
1160 

gag cgc 
Glu Arg 
1175 



etc cag cag etc gcg cca caa age acc gec tac cag ate 
Leu Gin Gin Leu Ala Pro Gin Ser Thr Ala Tyr Gin lie 
1150 1155 



gcg tgg cat gtg egg 
Ala Trp His Val Arg 
1165 



ggc egg gtg gac gtg ggc gee ctg 
Gly Arg Val Asp Val Gly Ala Leu 
1170 



gec ctg gag cag etc gtg cgc egg cat gag gcg ctg cgc 
Ala Leu Glu Gin Leu Val Arg Arg His Glu Ala Leu Arg 
1180 1185 



3474 



3519 



3564 



acc act ttc gag cct ggc ggc gac ggc gtg cct egg cag cgc ate 
Thr Thr Phe Glu Pro Gly Gly Asp Gly Val Pro Arg Gin Arg lie 



3609 
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1190 

cac gcg 
His Ala 
1205 



1195 



1200 



ccg get ccg gtc ccg ctg egg cag gtg gac 
Pro Ala Pro Val Pro Leu Arg Gin Val Asp 
1210 1215 



cat ggc ate gec gcg egg gag gag gee ttg egg tgg 
His Gly He Ala Ala Arg Glu Glu Ala Leu Arg Trp 
1220 1225 1230 

cag gcg etc egg cca ctg gag ctg gac aag ggg ccg 
Gin Ala Leu Arg Pro Leu Glu Leu Asp Lys Gly Pro 
1235 1240 1245 



ctg cgc tec 
Leu Arg Ser 



atg cgc gag 
Met Arg Glu 



ctg ctg cgc 
Leu Leu Arg 



gtg tec ctg ctg cgc ttg gaa 
Val Ser Leu Leu Arg Leu Glu 
1250 1255 



gag gca cag tec ctg etc ttc etc 
Glu Ala Gin Ser Leu Leu Phe Leu 
1260 



gag ctg cac cat ate gtg ggc gat ggc tgg teg ctg age gtt tgg 

Glu Leu His His He Val Gly Asp Gly Trp Ser Leu Ser Val Trp 

1265 1270 1275 

age cga gag ctg teg cac ctg tac gag gcg gcg etc cat ggc get 

Ser Arg Glu Leu Ser His Leu Tyr Glu Ala Ala Leu His Gly Ala 

1280 1285 1290 

gag ccg ggg ttg gcg ccg ctg ccc gtg cag tac gcg gac ttc gcg 

Glu Pro Gly Leu Ala Pro Leu Pro Val Gin Tyr Ala Asp Phe Ala 

1295 1300 1305 



ctg tgg 
Leu Trp 
1310 

ctg ace 
Leu Thr 
1325 



cag cgc ggc tgg ttg 
Gin Arg Gly Trp Leu 
1315 



caa ggg ccg gtg ctg 
Gin Gly Pro Val Leu 
1320 



tgg tgg egg gag egg etc gcg cgc ctg gee 
Trp Trp Arg Glu Arg Leu Ala Arg Leu Ala 
1330 1335 



egg gag gag 
Arg Glu Glu 



ccg ctg cgc 
Pro Leu Arg 



ctg ccc gcg gac cat gee cgc 
Leu Pro Ala Asp His Ala Arg 
1340 1345 



ccg gaa gtg cag cgc ttc aac ggc 
Pro Glu Val Gin Arg Phe Asn Gly 
1350 



gcg acg tac cgc ttc acg etc ccg ggc gta egg gtg 
Ala Thr Tyr Arg Phe Thr Leu Pro Gly Val Arg Val 
1355 1360 1365 



cag gcg ctg 
Gin Ala Leu 



egg egg etc ggg cat gag cac 
Arg Arg Leu Gly His Glu His 
1370 1375 



ggg gec aca etc ttc atg gtg ttg 
Gly Ala Thr Leu Phe Met Val Leu 
1380 



etc gee ggc ttc aat gcg ctg ctg gcg cgc tac ace 
Leu Ala Gly Phe Asn Ala Leu Leu Ala Arg Tyr Thr 
1385 1390 1395 

gac ate gee att ggc gcg ccc ate gee aac cgc aca 
Asp He Ala He Gly Ala Pro He Ala Asn Arg Thr 
1400 1405 1410 



ggg cag acg 
Gly Gin Thr 



cgt ggg gag 
Arg Gly Glu 



gtg gag ggc etc ate ggc ttc ttc gtc aac ace ctg gtg ctg cgc 
Val Glu Gly Leu He Gly Phe Phe Val Asn Thr Leu Val Leu Arg 
1415 1420 1425 



3654 



3699 



3744 



3789 



3834 



3879 



3924 



3969 



4014 



4059 



4104 



4149 



4194 



4239 



4284 



ace cgc ctg gaa ggc aac ccc tec ttc etc gag ctg ttg agg egg 



4329 
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gtg agg gag aca acg eta gag gec tac gec cac cag 
Val Arg Glu Thr Thr Leu Glu Ala Tyr Ala His Gin 
1445 1450 1455 

ttc gag agg etc gtc gag gag ttg cag ccc gag cgc 
Phe Glu Arg Leu Val Glu Glu Leu Gin Pro Glu Arg 
1460 1465 1470 

cag aac ccg ctg gtg cag gtg ctg etc gec ctg cag 
Gin Asn Pro Leu Val Gin Val Leu Leu Ala Leu Gin 
1475 1480 1485 



gag ctg cct 
Glu Leu Pro 



cag gee aac 
Gin Ala Asn 



aac gcg ccc 
Asn Ala Pro 



4374 



4419 



4464 



cgc gag ccg ttg egg ctg gcg 
Arg Glu Pro Leu Arg Leu Ala 
1490 1495 



ggc ctg gag gee gag cac ctg gag 
Gly Leu Glu Ala Glu His Leu Glu 
1500 



4509 



tac ctg gtg gee ace acg cgc ttc gac etc gag ctg 
Tyr Leu Val Ala Thr Thr Arg Phe Asp Leu Glu Leu 
1505 1510 1515 



cac ctg tgg 
His Leu Trp 



4554 



gag gag gaa gag ggg ctg age tgc ate gec gtg tat gac cgc gcg 
Glu Glu Glu Glu Gly Leu Ser Cys He Ala Val Tyr Asp Arg Ala 
1520 1525 1530 



4599 



ctg tac ggc gcg ggg acg gtg gag egg ctg gtg ggc 
Leu Tyr Gly Ala Gly Thr Val Glu Arg Leu Val Gly 
1535 1540 1545 



gec tgg tgc 
Ala Trp Cys 



4644 



acg ctg ctg gag ggc gtg gee gag etc ccg gcg cgc cgc gtg gcg 
Thr Leu Leu Glu Gly Val Ala Glu Leu Pro Ala Arg Arg Val Ala 
1550 1555 1560 



4689 



gag ctg ccc ctg gtg cct gee egg gag ctg cga cag gtt ccg ccg 
Glu Leu Pro Leu Val Pro Ala Arg Glu Leu Arg Gin Val Pro Pro 
1565 1570 1575 



4734 



ccc tec ccc gee gcg gcg ate gaa aca age ate ggc gee cgc ttc 
Pro Ser Pro Ala Ala Ala He Glu Thr Ser He Gly Ala Arg Phe 
1580 1585 1590 



4779 



teg gag gtg gcg egg egg cag cca ggc gee acg gee gtc ace caa 
Ser Glu Val Ala Arg Arg Gin Pro Gly Ala Thr Ala Val Thr Gin 
1595 1600 1605 



4824 



ggc ggg egg cac etc ace tat gca gag ctg gag gag cgc teg gaa 
Gly Gly Arg His Leu Thr Tyr Ala Glu Leu Glu Glu Arg Ser Glu 
1610 1615 1620 



4869 



cgc ctg gee cgc tac ctg gee tgg ctg ggc gtg cgc 
Arg Leu Ala Arg Tyr Leu Ala Trp Leu Gly Val Arg 
1625 1630 1635 



gcg ggc gac 
Ala Gly Asp 



4914 



aga gtg ggg ttg gee ace gag cgc acg ctg gag egg 
Arg Val Gly Leu Ala Thr Glu Arg Thr Leu Glu Arg 
1640 1645 1650 



ate ate age 
He He Ser 



4959 



ctg eta ggc ate etc aag gcg gga gee gee tac gta ccg ctg gat 
Leu Leu Gly He Leu Lys Ala Gly Ala Ala Tyr Val Pro Leu Asp 
1655 1660 1665 



5004 
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gtg cgc cag cca gcg cgt egg 
Val Arg Gin Pro Ala Arg Arg 
1670 1675 
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Leu Ser Leu Leu Val 
1680 
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ggc gtg cgc acc gtc ate gec gag gag cag get cgc 
Gly Val Arg Thr Val lie Ala Glu Glu Gin Ala Arg 
1685 1690 1695 

tea ggc ctg ggg cag cca ctg acc ctg gtc gat gee 
Ser Gly Leu Gly Gin Pro Leu Thr Leu Val Asp Ala 
1700 1705 1710 



cag gec gcg 
Gin Ala Ala 



acg gtg ctg 
Thr Val Leu 



gec cag gag 
Ala Gin Glu 



5049 



5094 



5139 



ccg gee tec gcg cag cag gtc ccc gcg etc ggg ccg 
Pro Ala Ser Ala Gin Gin Val Pro Ala Leu Gly Pro 
1715 1720 1725 



gag cgc teg 
Glu Arg Ser 



5184 



ctg ggc ggg gac atg etc gec 
Leu Gly Gly Asp Met Leu Ala 
1730 1735 



tac gtg ctt ttc acc 
Tyr Val Leu Phe Thr 
1740 



teg ggc tec 
Ser Gly Ser 



5229 



acc ggc gag ccc aag ggc gtc tgc att ccc cac cgt 
Thr Gly Glu Pro Lys Gly Val Cys He Pro His Arg 
1745 1750 1755 



gec gtg ctg 
Ala Val Leu 



5274 



egg ctg ate cat gag ccc tec tac gtc cag etc teg 
Arg Leu He His Glu Pro Ser Tyr Val Gin Leu Ser 
1760 1765 1770 



ccg egg gag 
Pro Arg Glu 



5319 



gtc atg ctg cac tac gca ccg ctg gag ttc gat gee 
Val Met Leu His Tyr Ala Pro Leu Glu Phe Asp Ala 
1775 1780 1785 



tec acc ttc 
Ser Thr Phe 



5364 



gag gtc tgg ggg gcg ctg etc 
Glu Val Trp Gly Ala Leu Leu 
1790 1795 



aac ggg gca agg etc 
Asn Gly Ala Arg Leu 
1800 



gtg ctg gtc 
Val Leu Val 



5409 



cct ccg gaa cag cag teg etc gag age ctg gga cag 
Pro Pro Glu Gin Gin Ser Leu Glu Ser Leu Gly Gin 
1805 1810 1815 



gag ctg age 
Glu Leu Ser 



5454 



acc cag ggc gtc acc gtg ctg tgg etc acc gee ggg etc ttc egg 
Thr Gin Gly Val Thr Val Leu Trp Leu Thr Ala Gly Leu Phe Arg 
1820 1825 1830 



5499 



etc atg gtg gag gag cag etc aag age ctg cgc ggc gtg cgc cag 
Leu Met Val Glu Glu Gin Leu Lys Ser Leu Arg Gly Val Arg Gin 
1835 1840 1845 



5544 



ctg etc gee gga ggg gac gtg ctg ccc atg ccg cag gta cgc egg 
Leu Leu Ala Gly Gly Asp Val Leu Pro Met Pro Gin Val Arg Arg 
1850 1855 i860 



5589 



eta cgc gag gee etc ccc gag tgc cag etc ate aac 
Leu Arg Glu Ala Leu Pro Glu Cys Gin Leu He Asn 
1865 1870 1875 



ggc tac ggc 
Gly Tyr Gly 



5634 



r-rr* arc; gag age tgc acc ttc acc tgc tgt cac cgc gtg ggc age 5679 

^ro Thr Glu Ser Cys Thr Phe Thr Cys Cys His Arg Val Gly Ser 
1=50 1885 1890 

<:<:c: t':ng gag ctg ggc ggc tec gtg ccc ate ggc acc ccc ate gac 5724 

nv .;: U i Glu Leu Gly Gly Ser Val Pro He Gly Thr Pro He Asp 
1895 1900 1905 
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ctg ggc tgg gtg age gtg gtg gac gag egg etc cag ccc gtg ccg 
Leu Gly Trp Val Ser Val Val Asp Glu Arg Leu Gin Pro Val Pro 
1910 ^ 1915 1920 



gac gga gee ccg ggc gag ctg 
Asp Gly Ala Pro Gly Glu Leu 
1925 1930 



etc gtg ggc ggc ccc 
Leu Val Gly Gly Pro 
1935 



ggg ctg get 
Gly Leu Ala 



tgg ggg tac ctg caa cac ccg gag etc ace gec gag cgc ttc ate 
Trp Gly Tyr Leu Gin His Pro Glu Leu Thr Ala Glu Arg Phe lie 
1940 1945 1950 

ccc gat ccg etc age egg acg ccg ggg get cgc gtc tat egg acg 
Pro Asp Pro Leu Ser Arg Thr Pro Gly Ala Arg Val Tyr Arg Thr 
1955 1960 1965 



gga gac ctg gtg egg cgc agg 
Gly Asp Leu Val Arg Arg Arg 
1970 1975 



gag gat ggg acg ctg 
Glu Asp Gly Thr Leu 
1980 



gag ttc ctg 
Glu Phe Leu 



ggc cgc gtg gac cac cag etc aag gta cgc ggc ttc cgc ate gag 
Gly Arg Val Asp His Gin Leu Lys Val Arg Gly Phe Arg He Glu 
1985 1990 1995 

ccg ggc gag gtg gag gee gcg gtg etc ace cac ccg gcg gtg cag 
Pro Gly Glu Val Glu Ala Ala Val Leu Thr His Pro Ala Val Gin 
2000 2005 2010 

tec gcg gtg gtg gtg ggg cgc gag ggc ccc ggc ggc aag gag ctg 
Ser Ala Val Val Val Gly Arg Glu Gly Pro Gly Gly Lys Glu Leu 
2015 2020 2025 

gtc tgc tac gee gtg ccg egg gtg gag age teg gag cag ggc teg 
Val Cys Tyr Ala Val Pro Arg Val Glu Ser Ser Glu Gin Gly Ser 
2030 2035 2040 

cag cag gag cag egg ctg gta cac gag tgg gag tec gtg ttc gac 
Gin Gin Glu Gin Arg Leu Val His Glu Trp Glu Ser Val Phe Asp 
2045 2050 2055 



ggg cac atg tac cgc gaa gec ccc gtg ggc gga gag 
Gly His Met Tyr Arg Glu Ala Pro Val Gly Gly Glu 
2060 2065 2070 



ccg ace ttc 
Pro Thr Phe 



5769 



5814 



5859 



5904 



5949 



5994 



6039 



6084 



6129 



6174 



6219 



aac ate gtc ggc tgg aag age age tac ace ggc caa ccc gtg gee 
Asn He Val Gly Trp Lys Ser Ser Tyr Thr Gly Gin Pro Val Ala 
2075 2080 2085 



6264 



gtg gag gag atg agg gac tgg ctg cgc cac egg gtg 
Val Glu Glu Met Arg Asp Trp Leu Arg His Arg Val 
2090 2095 2100 

cgc ggg etc agg cca cga egg ate etc gaa gtg ggc 
Arg Gly Leu Arg Pro Arg Arg lie Leu Glu Val Gly 
2105 2110 2115 



gag cgc gtg 
Glu Arg Val 



tgt ggt acg 
Cys Gly Thr 



ggg ctg atg etc ttt gee ctg ctg ccg cac tgc gag cgc tac gtg 
Gly Leu Met Leu Phe Ala Leu Leu Pro His Cys Glu Arg Tyr Val 
2120 2125 2130 



6309 



6354 



6399 



ggc acg gat ttc tec ccg gcg gcg ttg gac tac gtg cgt cgc tac 
Gly Thr Asp Phe Ser Pro Ala Ala Leu Asp Tyr Val Arg Arg Tyr 



6444 



WO 2004/022586 
2135 



etc ccc ccg gag cac ccg ggc 
Leu Pro Pro Glu His Pro Gly 
2150 2155 



238 



2140 
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2145 



cgc gtg gag ctg eta 
Arg Val Glu Leu Leu 
2160 



cac cgc acg 
His Arg Thr 



6489 



gcg gac gag tgg age ggc gtg gcg gec ggc tec ttc 
Ala Asp Glu Trp Ser Gly Val Ala Ala Gly Ser Phe 
2165 2170 2175 



gac gcg gtg 
Asp Ala Val 



ctg etc aat teg gtg gtg cag tac ttc ccc teg cag gag tac ctg 
Leu Leu Asn Ser Val Val Gin Tyr Phe Pro Ser Gin Glu Tyr Leu 
2180 2185 2190 



6534 



6579 



egg cag gtg ctg gee cgc tgc gtg gag gee gtg gag gac ggg ggc 6 624 

Arg Gin Val Leu Ala Arg Cys Val Glu Ala Val Glu Asp Gly Gly 
2195 2200 2205 

ttc gtc ttc gtg ggg gat gtg egg age ttg ccg ctg ctg gag tec 6669 
Phe Val Phe Val Gly Asp Val Arg Ser Leu Pro Leu Leu Glu Ser 
2210 2215 2220 



ttc cac gee teg gtg gag ctg 

Phe His Ala Ser Val Glu Leu 
2225 2230 

ctg gag gcg tgg cgc gag egg 

Leu Glu Ala Trp Arg Glu Arg 
2240 2245 



gag egg gee gee ccc 
Glu Arg Ala Ala Pro 
2235 



tec atg ccg 
Ser Met Pro 



gtg egg cgt gcg gtg ctg gag gac 
Val Arg Arg Ala Val Leu Glu Asp 
2250 



aac gag ctg gtg gtg gac ccg gca etc ttc gtg gcg ctg gec cat 
Asn Glu Leu Val Val Asp Pro Ala Leu Phe Val Ala Leu Ala His 
2255 2260 2265 



6714 



6759 



6804 



cag cat cca egg gtg age cac gtg gat ate gag ctg acg cgc ggc 
Gin His Pro Arg Val Ser His Val Asp lie Glu Leu Thr Arg Gly 
2270 2275 2280 



6849 



acg cac ccg aat gag atg gca cgc ttc cgc tac aac gee gtg etc 
Thr His Pro Asn Glu Met Ala Arg Phe Arg Tyr Asn Ala Val Leu 
2285 2290 2295 



6894 



cac ate ggg ccg cgc acg ccg cca ccg gee tec gag gtg ccg tgg 
His lie Gly Pro Arg Thr Pro Pro Pro Ala Ser Glu Val Pro Trp 
2300 2305 2310 

gtg gac tgg tec acg caa gga ctg age etc gat gca ctg egg get 
Val Asp Trp Ser Thr Gin Gly Leu Ser Leu Asp Ala Leu Arg Ala 
2315 2320 2325 



6939 



6984 



egg c^c egg cag ggc ccg cca ggg cca ctg ggg gtg gca ggc att 7 029 

Arg Leu Arg Gin Gly Pro Pro Gly Pro Leu Gly Val Ala Gly lie 
2330 2335 2340 

ccc aat gcg cga gtg ctg ccc gcg gtc egg gec gec gag gcg ctg 7074 

Pro Asn Ala Arg Val Leu Pro Ala Val Arg Ala Ala Glu Ala Leu 
2345 2350 2355 



ggc age 



aca ggc age gcg egg 
Thr Gly Ser Ala Arg 
2360 2365 



egg gtg gag gag ctg cgc egg cgc 
Arg Val Glu Glu Leu Arg Arg Arg 
2370 



7119 



ctg tec caa ccc ggc aca ggg gec cag gat cca gac ccc ttc tgg 



7164 
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Leu Ser Gin Pro Gly Thr Gly Ala Gin Asp Pro Asp Pro Phe Trp 
2375 2380 2385 

caa ctg gcc gag teg etc ggc tac acg gcg gca gtg age tgg tct 
Gin Leu Ala Glu Ser Leu Gly Tyr Thr Ala Ala Val Ser Trp Ser 
2390 2395 2400 

cct ggg egg agg gac ggg gcc ttc gac gtg etc ttc etc ccg gcg 
Pro Gly Arg Arg Asp Gly Ala Phe Asp Val Leu Phe Leu Pro Ala 
2405 2410 2415 

acg cca ggg atg cac ccg egg tgg etc ggc ccc acg ccg etc aac 
Thr Pro Gly Met His Pro Arg Trp Leu Gly Pro Thr Pro Leu Asn 
2420 2425 2430 

cgc acg ccg ccc cca gcc tec gcg cgt ctg tec teg gag cct egg 
Arg Thr Pro Pro Pro Ala Ser Ala Arg Leu Ser Ser Glu Pro Arg 
2435 2440 2445 

egg gcc age etc tea ctg agg ctg ggc tec gcg ctg egg gcc cac 
Arg Ala Ser Leu Ser Leu Arg Leu Gly Ser Ala Leu Arg Ala His 
2450 2455 2460 

ctg cag ace cat eta cca gac ttc atg gtg ccc tea cgc ttc gtg 
Leu Gin Thr His Leu Pro Asp Phe Met Val Pro Ser Arg Phe Val 
2465 2470 2475 

gtg ctg cag tec ctg ccg etc act ccc aat ggc aag gtg gac cgc 
Val Leu Gin Ser Leu Pro Leu Thr Pro Asn Gly Lys Val Asp Arg 
2480 2485 2490 

gcg gcc ttg ccc gtc ccc gac teg cgt egg ctg gaa tct gcc ccg 
Ala Ala Leu Pro Val Pro Asp Ser Arg Arg Leu Glu Ser Ala Pro 
2495 2500 2505 

ctg gtg ccc ccc age aac gag ctg gag egg gtg ctg gcg cag gtg 
Leu Val Pro Pro Ser Asn Glu Leu Glu Arg Val Leu Ala Gin Val 
2510 2515 2520 



7209 



7254 



7299 



7344 



7389 



7434 



7479 



7524 



7569 



tgg aag gaa gtg ctg gga ctg gag gaa gtg age egg gag gac aac 
Trp Lys Glu Val Leu Gly Leu Glu Glu Val Ser Arg Glu Asp Asn 
2525 2530 2535 



7614 



ttc ttc gac gtg gga ggg cac teg ctg ctg ctg gcg cag gtg tgc 
Phe Phe Asp Val Gly Gly His Ser Leu Leu Leu Ala Gin Val Cys 
2540 2545 2550 



7659 



age egg ctg gag gcc cga ctg ggc egg egg etc gaa ctg gtg ace 
Ser Arg Leu Glu Ala Arg Leu Gly Arg Arg Leu Glu Leu Val Thr 
2555 2560 2565 



7704 



etc ttc cgc tac tec tec att get gcc ctg gcc gag cac ctg cag 
Leu Phe Arg Tyr Ser Ser lie Ala Ala Leu Ala Glu His Leu Gin 
2570 2575 2580 



7749 



gcc ccc cag gag ctg gca gcc gcc gag gcg cag gtg cag cgc atg 
Ala Pro Gin Glu Leu Ala Ala Ala Glu Ala Gin Val Gin Arg Met 
2585 2590 2595 



7794 



gcc acg gag cgc gcc ttg etc cag caa cag gca gcc caa cgc cgc 
Ala Thr Glu Arg Ala Leu Leu Gin Gin Gin Ala Ala Gin Arg Arg 
2600 2605 2610 



7839 
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egg gec ggt teg aag egg gga get ccc aat gac ace tag tggggatgaa 
Arg Ala Gly Ser Lys Arg Gly Ala Pro Asn Asp Thr 



gcgctgcagt cttccatcgc cctggtaggc atggccggcc gcttcccggg ggccccggac 
gtggagtcct tctggcgcaa tctggtggcc ggagtggagt ccatttcctt cttctccgag 
gaagagctga gaeaggeggg cgtgtccgag cagataegtc ggcgccccga gtacgtgccc 
gecaagggtg tgctggaaga cctggagctg ttcgacgegg gettcttegg ctactcgccg 
cgegaggega gccacctgga tccgcagcaa cgcctgctgc tggagtgctc gtgggaggcg 
ctggaggatg cgggccttcg cccggatcag ctcccaggct gggtgggcgt ctacgtgggc 
geeggggaca ccagctaccg cttccagctg ctgcgcggac acggggaccc gctcagcggc 
tccaaggatg tggegggett ctttggcaac tacccggact ttctcgccac gcgcgtggcc 
tacaagctca atetgegegg ccccgcgctc ggcatccaca ccgcgtgctc cacctcgctc 
gtctccatca acatggcgtg cagcgcgctg cgcggcttcg agtgtgacat ggcgctcgcg 
ggeggegtet cgctgcggct gcccgcgcgc tccggctacc tctacgagga ggggggagtc 
gectcgaagg aegggcactg ccgccccttc gaegegegtg ctaccggcac cgtcacgggg 
gatggcgtgg gcgtggtggt gctcaagcgc ctggaggacg egctgaagge cagggatccc 
atccacgcgg tgatccgegg ctgggcgctc aacaacgacg gcgcctcccg ggctggcttc 
accgcgccca gcgtggaggg ccaatccgag gtgatcgccc tggcgcacgc cgcggcgggc 
atcagcgcac gcgacatcac ctacgtggag gctcacggca ccggcactcc cctgggagac 
ccgatcgaag tggeggeget cacccgcgcc ttccgggcgc atacegctga caccgccttc 
tgtaccctcg gtgcggtcaa atccaacatc ggacacctgg atgeggegge aggagtggcc 
ggggtgatca agaeggtgea ggccctgcgc caccggctca tccctcccac gctccacttc 
gagcggccca atcccgcgct ccacctggag cagagcccct tettegtcaa cacccagccg 
ctcccgtggg agagcccgcg cgggcctcgg ctggcggggg tgagctcctt tggcattggc 
ggcaccaacg cccacacgct cttcgaagag gctccgccgc cgcccgccag cggccccacc 
cgtcccaacc aggtgetget gctctcggcc cgaagcacca gcgcgctcga gcacatcgca 
ggaeggcteg cggcacacct gcgccgccac ccggacctgg aactggcgga cgtggccttt 
accctccagg tggggegage ccgcttcccc taccggcgcg ccctcacctg tcgcaccctg 
gccgaggcca tggagegect ggaggctccc gagccgcggc cacccgagcc gcttgctcac 
gagggagagc ggccccctct cgtcatgctc tttccagggc agggcactcc getggtegge 
acggcccgtg cgctccatga atccgagccc accttccgcc aggcggtgga gcagtgtgcc 
cggctcctga ggcagacget ggggctggat gtcagggagg tgctcttccc ctccgcggag 
caggaggagc aggcacgccg gctggcggcg cagacgeggg tggcccagcc cgcgctcttc 
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accctggagt 


atgccctggc 


gcagacctgg 


ctgggctggg 


ggctccaacc 


gcaggcgctg 


gcaggccata 


gcctgggcga 


gctggtggcg 


gcgtgcctcg 


cgggtgtctt 


ctccctggaa 


gatgcacttc 


agctcgtggc 


agcccgggga 


cagctcatgc 


agggatgccc 


accgggggcc 


atgctggccg 


tgccgctgcc 


ggaggccgag 


ctggcggcgc 


tgctgggcag 


cgagctgtgt 


atcgccgcgg 


tgaacgggcc 


acgggcctgc 


gtggcctcgg 


gtcctctacc 


ggcggtagag 


gcgctcaccg 


ccgcgctgga 


gagccggggc 


gtgtccagcc 


ggcgcctgga 


gacttcccat 


gccttccact 


ctgcctccat 


ggaggcatgc 


caggggccac 


tcaccacgct 


gctgcggcgc 


atgcgcctgc 


aggcccctcg 


cctcccctgt 


gtctccggcc 


tcaccggccg 


gtggctcacc 


ggcgaggagg 


ccacggagcc 


tacgtactgg 


gcccgccagc 


tccgcgagcc 


ggtacgtttc 


tccgaggccc 


tcgagacgct 


ctggagcctc 


aaggagccag 


tgctgctgga 


ggtaggcccc 


ggcaccacgc 


tcaccgcgct 


tgcgcggcgg 


caccccaccc 


gccccgcccg 


cacccaggag 


gtggcgagcc 


ttcccgtgca 


accggacacg 


gccgtgccgt 


gcattgaaga 


ggccgtgggt 


gagctgtggc 


aggccggcct 


ggagctggac 


tggagcgccc 


tgcacgccgc 


gccacgccac 


cgcgcacacc 


ttccacccta 


ccccttcgag 


cgccagcgct 


actggattga 


accggaggct 


gccccgcagc 


cgcgcgcgca 


gcagccgacg 


cccgcgtccc 


ttgtgccccc 


ggagcagccc 


tcgcgcgagg 


ccctggaaga 


ctggttctac 


gtgcccacct 


gggagcaggc 


acctgccacg 


agcggggggg 


gacagcccct 


ggcgggcccg 


gtgctcgcct 


tcatggactc 


ctcgggcctg 


gccgagcagg 


tgctggccgc 


gctgtggccc 


gcggactccg 


gcgcgctcct 


cacccgcgtc 


gagccggccg 


ggcactatga 


gcaactcagt 


gagcacgcct 


tccgcctccg 


ccccgagagc 


gaggaggact 


gggacgcgct 


cttccaggcg 


ctccagtccc 


agggccggct 


cccccgtcgc 


atcctccatg 


cctgggctct 


caccgcggag 


cccggcccct 


gtaccccgga 


c ggggaagcg 


gtgctggaac 


agggcttctt 


cagcctgctg 


cggctggccc 


gtgcgctcgg 


ccgccacgcg 


cctgagcgcc 


ctgtacagct 


cgaggtgctg 


tccagcttcg 


tccacgcggt 


gggcccgcgg 


gagccgctcg 


agccgctcaa 


ggccacgctg 


ctgggggcct 


gtgcggtact 


tcccctggag 


taccctcacg 


tgcagtgccg 


caccattgac 


gtgcggcccg 


ggagcgagcc 


acgggaagtg 


ctggtgagga 


gcctggccgc 


cgagctggcc 


gctcccatgg 


gagagagccc 


ggtggcctgg 


cgcgacgggc 


agcggtacgt 


gcgccgcgcc 


acccggcaga 


ggctcgaggc 


ctcgcggccc 


ctgcgcagcc 


ttcgggagcg 


aggcgtgtac 


ctggtagcag 


gagggctggg 


tggaatcggc 


ctggtgctgg 


cgcgagcgct 


ggcccagcgg 


gcgcgtgctc 


ggctggcgct 


actcacccac 


tcgcccttcc 


ctccccgaga 


gcagtgggag 


cagtggctgg 


aggaagcccc 


ggcgcacccg 


gagccggcgt 


ggcggagcga 


ggccgacccc 


tcggagcgcc 


gcaggaccca 


gcaccgcatc 
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cgctgcctgc 


tggagctgga 


gcagctcggc 


gccgaggtgc 


aggtgtacac 


ggcggacgtg 


gcggaggagg 


ccgccgtgcg 


ctcggtggtg 


gagcaggtgc 


acgcccgctg 


ggggaagatc 


cacggcgtgc 


tgcacgccgc 


cgccacgttc 


gacgacggcg 


tcatccagct 


ccgcacgcac 


gagcagtcct 


cgcgggccct 


gcgcaccaag 


gtgcggggca 


gcatggtgct 


gcacgaggtg 


ctggcgagcg 


aggggttgga 


ttggttcgcc 


ctgtgctcct 


cgctggcgtc 


ggcgctgggc 


tcattcggcc 


aggcggacta 


ctgcgcggcc 


aatgccttcc 


aggatgcgta 


tgcgcaccac 


ctgcgccggc 


agggcttcac 


gggagcgctg 


gcgctggact 


ggggcacgtg 


gagagatacg 


ggggcggcca 


tgcggctggt 


ggcacgcacc 


cgacgggggg 


gccatgagaa 


gccgcccacg 


ccgctcaccc 


acccactctt 


cgattgcgaa 


cagcgcgagc 


cgggcgggac 


gcactggctc 


ggcctcacgc 


tgaggggtgg 


agaggactgg 


gtggtggacg 


agcaccggct 


gcaaggggtg 


ccaacactgc 


ccggggtggc 


ctacctggag 


ctggcgcggg 


cggcgtgtgc 


ccaggcgctg 


ggggccgagg 


cggtggagct 


ggcggagctg 


ttgctgctgg 


agcctctgac 


ggtacctcga 


ggcgaatcca 


ggcaggtccg 


ggtggtgctc 


cagcccgagg 


ggcaggctca 


tgccctgcgg 


gtggagagcc 


ggtcggagga 


agcgcgggga 


tggaatgagc 


atgcgcgggg 


ccgggtgcgc 


gccgtgcctc 


ggctggccga 


gcgcatccag 


cccgagctgc 


tgcgcgccgc 


ctgcgaacac 


gagcagcccg 


tgcccggaga 


gccccaggag 


caaggccctg 


tccacgccgg 


agcacgctgg 


catggcctct 


tccagtgggt 


tcgccgcggc 


cctcgccagg 


ccctcgccca 


gctcgcgcta 


ccggagccct 


tccacggtga 


cctcgagcgc 


ttcgagctcc 


acccggcgct 


catggacatg 


gccaccagct 


tcgccattcc 


cggcggagtg 


ccctggctcg 


cctttggcta 


tgagcgcgtg 


ctcattcacg 


gtccgctgcc 


accgcaggtg 


ctcagccacg 


tgagcctgcc 


cgaggagtcg 


caggctggcg 


cgcaacaact 


ccggctacag 


gtgcggctgc 


tggatctgga 


gggctgggag 


cgggtgcgga 


tcgatgggta 


cctgctgcgg 


ccgctgaagc 


ccagcgacgc 


cagtgtcgag 


ccggcggcgc 


ccaacgtgga 


agtcgccgtg 


gggacgccgg 


ggctgctgga 


gagccttggc 


ctgcgccgct 


gcacccgccc 


cgcccccggg 


ccgcgccagg 


tggagatcga 


ggtggaggcc 


gctgggctca 


acttcctgga 


cgtgctgggc 


gcgctgggga 


tgatgccggc 


attggaggcg 


gaggagagcg 


tactggggcg 


cgagtgctcc 


ggacgcattg 


ccgccgtggg 


cgagggcgtc 


agcgggctgc 


gcgtggggga 


cgaggtgctg 


gcggtggccc 


caggctgctt 


ccgctcctac 


gtgctggtgg 


atgagagcca 


ggtggtgcgc 


aggcccgcct 


cgctggggct 


cgccgagggg 


acaacccaga 


tggtgccgtt 


cgccacggcg 


tacttcgccc 


tgcacaccgt 


gggccggttg 




agcgcatcct 


catccacgcc 


gcggccggag 


ggctgggtct 


ggccgccgtt 




cccggaccgg 


ggcggagata 


ttggccaccg 


cgggcagtga 


gcagaagcgc 


gagtacctgc 


gctcgctcgg 


cattgctcac 


gtgctggact 


cgcgcagcac 


ctccttcgtc 
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agcgaagtgc gcgagcgcac cggcgggcgt ggggtggatg tggtgctcaa ctcgctggcg 13528 

ggagagctgc tcctggcggg cctgtccgtg ctggccccgc acggccgctt cctggagctg 13588 

gggaagaggg acctgtatgc ggaccagcag gtgggcctcc gaaccctggc ccgagggcag 13648 

actttcgccg ccattgactt cggtccccac cacccggact tccgagcggt gctcgaggag 13708 

gtggccacgc aactcaccca gggccagctc gagccattgc ccacccgcct cttccccgcg 13768 

cggcaggtgg ccgaagcctt cagcttcatg gcccgcgcgc tgcacatcgg tcgcgtcgcc 13828 

gtctccatgc agggggcgac ggcattgccc gcgtccatga ctcggggctc caggcccgca 13888 

ccggtggccg tacctccttg ggaggacccg cggctggcgg gcggcatctc ctctgaagag 13 948 

ggtgcggagg ccttcctgcg cgcgttggag caaggggcac cgcagctcat catctccccc 14008 

caggacttca gctcgctgct gcgcggcctg ggcggcagcc agggcgtgcg cgaaaaggag 14068 

cgcctcgtca ccgggcgcgc cgccgccgcc gagccgcagg ccttgccacc ctcctcgctg 14128 

gagcagctca tcgagcaggt gtggcgcaag cacctgggtg tggagcgcgt gcagcccacg 14188 

gacagcttct tccagcttgg aggagactcg ctgctgggca tccaagtggc ggcggatctc 14248 

cgcaggcacc tgggtgtgga gctgcccacg gccaccctct tcagccaccc caccctcgcc 14308 

gcgctggccg cggctctgcg ggctcgacag ggtgaggccg cggctcccac tgcccccgcg 14368 

ccagcgctcg tgccggatcc agccgcgcgc ttcgagcctt tccccctcac ggatgtgcag 14428 

gaagcctact gggtgggtcg ccgctcggcc ttcgagctcg gcggcgtcgc cgcccatggc 14488 

tacttcgaaa tcgaaagccc ggggctggag gtggagcgct tcatccaatg ctggcgccag 14548 

ctcctgcagc gccacgacat gctgcgcatg gtggtgctcc cggatgggcg gcagcaggtg 14608 

ctcgagcagg tgccggagta cacgccggag gtggtggagc tgcgggggct ctccccccag 14668 

gaggccgagt cccgccggct ccagctgcgt gagcgcatgg cccaccaggt cctgcgcagc 14728 

gaccgctggc cgctcttcga gctggtgctc tgccggtacg agggaggcgt ccgcatccac 14788 

atgagcatgg atgccctgat gctggatgcg tggagttcag cggtgcttcg gcaggacttc 14848 

gcccagctgt accacgagcc gggccggccg ctggagccgc tggccatcac cttccgcgac 14908 

tacgtgctgg cggagcgccg gctgcgcgag ggcgaggccc atgagcgcgc ccgcgcatac 14968 

tggtgggctc ggctggacac gctgccgcca ccgcccgagc tgcccctggt gaaggaaccc 1502 8 

tcgcagctgg agcacgcgcg gttcacccac cgcgaggctc ggctcgagcc acaccgctgg 15088 

gcccggctcc aggagcgggc gcgcgcccac ggcctcaccc cctcggccgc ctgcatggcc 15148 

gccttcgccg aggtgcttgc ccgctggagc cgtcacccgc gcttcaccct caacctcacc 15208 

ctcttccagc gcttgcccct gcacccgcag gtggacgagc tggtgggcga cttcacctcc 15268 

ctggtcctgc tggaggtaga ggcacacgcg gcgagcacct tcgccgagcg tgcctcccgg 15328 
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ctccaggcac agctatggcg ggacctggag cacggcagcg tgagcgccgt gcagctcatc 15388 
cgcgagctcg tccgcaccgg ccgccgctcc ccgggcgcca tcatgcccgt cgtcttcacc 15448 
agcatcctca gcctggatgc gcggcgcggc ccccagggca gcctctcctt cttcgaggga 15508 
gaactggtgt acagcatcag ccagactccc caggtgtggc tggaccacgg agtccacgag 15568 
gaggaggggg cgctcgtcct ggcgtgggac tcggtggagg cgctcttccc tccgggcatg 15628 
gtggacgaca tgttccacgc ctaccagcgg ctgctggggg cacfccgccga ggaggagcaa 15688 
gcgtgggagg gcgagctgcc ggagctgctg cctcctgccc agcgtgagtt gctcgcgcgc 15748 
tacaacgcca cccaggcgcc gcggcccagc ggacggctgg aggagggctt ctfccacccag 15808 
gcgcggctcc accccgagct ccccgcgctg ctcgcaccgg agcgcaccct gagctacggc 15868 
gagctggcaa ggcgagccca ggcgctggcc gcgcgcctac gcgagctgga ggtgcagcct 15928 
caggagttgg tggccattgc catgcacaag ggctgggagc aggccaccgc cgtgctcggc 15988 
gtgctgcagg cggccgcggc ctacctgccg ttggatccag agcagccgcc gctccggctc 16048 
caccagctcc tggaggaggg gcccgcccgc gtggtgctca cccagtcctc cttgctgcac 16108 
accgtgccct ggccgccggg tgtgcaggtg atcgccgtgg acgaactcga gcctgcaacc 16168 
gaggcccccc cgctaccgcc tcgcggtaca ccggagcacc tcgcctacgt catctatacc 1622 8 
tccggctcca ccggcaaacc caagggtgtg gccattgaac accgggccgc gctcaacacc 16288 
gtggtggacc tcaacacccg ctttggcgtc ggcccggagg atcgggtgct gggcctgtcc 16348 
gcgctcacct tcgacctgtc ggtgtacgac gtgctggggc tgctcggcgc gggcggagcg 16408 
ctggtgctgc ccgcggcgga ggcggagaag gatcccgccc actggtggga gcggctggtg 16468 
gctgggcggg tcacggtgtg gaactccacc ccggcgctga tgctgctgct ggtggagtac 16528 
gccgagcagc gcgggctgaa gctgcccgcg gcgctgcgcc tggtcatgct cagcggcgac 16588 
tggatccccg tggcgctacc ggatcgcatc cgcgcactcg gcagggacgt gcaggtggtg 16648 
agcctgggag gcgccaccga ggcttccatc tggtccattg cctaccccat tggccaggtg 16708 
gcaccgcagt ggaagagcat tccctatggc atgccgctgg ccaatcagcg attccacgtg 16768 
ctggacgggc ggctggaggc acggccctgg tgggtgccgg gcgagctgta cattggcggc 16828 
gaggggctgg cacgcgagta ctggagggat gagccgctca ccgcaacgcg cttcatccgt 16888 
cacccgcgca caggcgagcg gctgtaccgc accggggacc aagggcgcat gctgcccgaa 16948 
gggagcatcg agttcctcgg ccgggaggat ctgcaggtca aggtgcaggg cttccgcgtg 17008 
gs^ctggggg agattgaagc ggcgctggcc cagcacccgg cgctatcggc gagcgtggtg 170 68 
gtggcacgag gagagcctcg gggagtgcgg cggctggtgg cctacgcggt tcctcgctcg 17128 
ggrcagacgc ccgcggcagg cgagctgcgc cgctacctgg cggagcggct acccgcgtac 17188 
atggtgccct ccgccttcgt cctcctggag tccctgcccc gcagccgcaa cgggaagatc 17248 
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gcccgagatc agctccccga gccgcagcag acacagggac tggctgcaca ggccgcggcg 173 08 

gccgacccgc tcgtggagcg gctggcagcc ctggtgaaag aggcgctgag gctcgagcgc 17368 

gtggagcccc aggacagcct gctggacctg ggggcggact cggtggcgct catccgcctc 17428 

atcaaccgcc tggaggcaga gctgcagttc cgcccgcgac tggccgacat ctacgagaac 17488 

cccaccgtgc aggggctcgc cacgctccac caagagaaga caaagagcca gggagaggga 17548 

ggcgctccgc ggctcacggc gccccgctcc acgcttctgc ccgccgagga gtgggggcgc 17 608 

ttcaaggcca accgcccggg cctgcgccgc ttccccgatg gaacgcctga ggtggcactg 17668 

ccaggcagcg gcttggcgcc ggcccccgag gagctgacag ctctcgagcg ccgccgcagt 17728 

gtgcgcacct actccctcga gcccgtgagc catgagcagt tgggacggct gctggccccc 177 88 

ttgcgcgagt gggaggtgca gggctcgcgg cgttacctct acgcctccgc gggcgggctc 17 848 

tacccggtgc agctctacct ccacctcaag cccggccggg cgcgagggct cgagcccggc 17908 

acctggtatt acgatcccag cacccatcgg ctggtgctcc tgtccgccgg agccgggctg 17968 

gatcgccgca tccatgatcc gcaccagaac caggccatct tcgactcggc cgcgttctcc 18028 

ctcttcctca tcgcccgcat gggggccgtg gagccggtat acgccgaaca tgccctgcac 18088 

ttcgccacgc tggaggcagg gctgatgacc cagcttctcg atctgggcgc ggctcccagt 18148 

gggctggggc tgtgccatat cggcgacctg gacttcgccc aggcgcgggg gctcttccac 182 08 

ttggaggagg agcacgtgct gctgcacagc ctggtgggcg gagtactccc cactcggggg 18268 

caggaggcgg cctcggtgcc cgctgaggga ggaacggagg cgcggcaact ggcgcagctc 18328 

ctgcaacagg tgaagacgct cacgcccgag gccgcgcgag cgctgctcga ggcccgccgc 18388 

ggcagcaagg ggagaccac atg agt aag ctg cac gag gag ctg gag age 18437 

Met Ser Lys Leu His Glu Glu Leu Glu Ser 
2630 2635 

ctg gcg ccg gag cag egg gag eta etc gcg gee ctg atg aaa gag 18482 
Leu Ala Pro Glu Gin Arg Glu Leu Leu Ala Ala Leu Met Lys Glu 
2640 2645 2650 

cag ggc ctg gac gag ggc gca ttg ctg atg ccc gtg gag cgc aag 18527 
Gin Gly Leu Asp Glu Gly Ala Leu Leu Met Pro Val Glu Arg Lys 
2655 2660 2665 

ccc gag ggg ctg ccg etc teg tec gec cag cag cgc atg tgg ttc 18572 
Pro Glu Gly Leu Pro Leu Ser Ser Ala Gin Gin Arg Met Trp Phe 
2670 2675 2680 



ctg caa cag etc agg cct acg age ccc ttc tac aac gtg cac gcg 
Leu Gin Gin Leu Arg Pro Thr Ser Pro Phe Tyr Asn Val His Ala 
2685 2690 2695 

gcg etc egg etc acg gga aag ctg aag gtg gag tgc ctg gtg cac 
Ala Leu Arg Leu Thr Gly Lys Leu Lys Val Glu Cys Leu Val His 
2700 2705 2710 
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age ctg aat gag ttc gtc egg cgt cac gag 
Ser Leu Asn Glu Phe Val Arg Arg His Glu 
2715 2720 



ccg ctg cgc acg gtg 
Pro Leu Arg Thr Val 
2725 



ttc ccc tec gee gga ggt caa ccc ctg cag cgc ate etc gcg ccc 
Phe Pro Ser Ala Gly Gly Gin Pro Leu Gin Arg He Leu Ala Pro 
2730 2735 , 2740 



gca ccg get gcg etc 
Ala Pro Ala Ala Leu 
2745 



gag cag cgc gac ctg age ggc gtg ccc gec 
Glu Gin Arg Asp Leu Ser Gly Val Pro Ala 
2750 2755 
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cag gag cga gag gee gag gtg tac cgc gcg gtg gag cac gca gcg 
Gin Glu Arg Glu Ala Glu Val Tyr Arg Ala Val Glu His Ala Ala 
2760 2765 2770 

etc gee tec ttt gat ctg gag cga gag ccg ccg tgc cgc ttc etc 
Leu Ala Ser Phe Asp Leu Glu Arg Glu Pro Pro Cys Arg Phe Leu 
2775 2780 2785 

ctg gtg cga gtg gag ccg gag gaa cac gtg ctg gtg ttc gec acg 
Leu Val Arg Val Glu Pro Glu Glu His Val Leu Val Phe Ala Thr 
2790 2795 2800 

cac cac ate gca gcg gat ggc tgg teg etc ggc gtc ttc gtg cag 
His His He Ala Ala Asp Gly Trp Ser Leu Gly Val Phe Val Gin 
2805 2810 2815 

gag ctg tgc gcg ctg tac acg get get gta cac gcg gag cca ccg 
Glu Leu Cys Ala Leu Tyr Thr Ala Ala Val His Ala Glu Pro Pro 
2820 2825 2830 

gcg ctg cca ccg ctg egg ctg cag tac gcg gac ttc gee gcg tgg 
Ala Leu Pro Pro Leu Arg Leu Gin Tyr Ala Asp Phe Ala Ala Trp 
2835 2840 2845 

gag egg age egg etc aag ggt ggg cgc gag cgc gag ctg etc gag 
Glu Arg Ser Arg Leu Lys Gly Gly Arg Glu Arg Glu Leu Leu Glu 
2850 2855 2860 

tac tgg cag gag cag ctg gca ggg ctg ccg gac ctg age acc ctg 
Tvr Trp Gin Glu Gin Leu Ala Gly Leu Pro Asp Leu Ser Thr Leu 
2865 2870 2875 

ccg cca gag egg ccc agg ccc ccg ctg tec aag cag gag ggc gee 
Pro Pro Glu Arg Pro Arg Pro Pro Leu Ser Lys Gin Glu Gly Ala 
2880 2885 2890 

acg ttc gaa ttc gee ctg cca cca cac cag gtg cag gec ctg cgc 
Thr Phe Glu Phe Ala Leu Pro Pro His Gin Val Gin Ala Leu Arg 
2895 2900 2905 

tea ctt get caa gcg egg cgc acc teg etc ttc tec gtg ctg etc 
Ser Leu Ala Gin Ala Arg Arg Thr Ser Leu Phe Ser Val Leu Leu 
2910 2915 2920 

gec gee ttc caa tgg ctg ctg gec egg tgc gee ggc cag gac gac 
Ala Ala Phe Gin Trp Leu Leu Ala Arg Cys Ala Gly Gin Asp Asp 
2925 2930 2935 

gtg gee ctg ggc atg ccc ate gec aac cgc aac cgc aag gag ctg 
Val Ala Leu Gly Met Pro He Ala Asn Arg Asn Arg Lys Glu Leu 
2940 2945 2950 



18842 



18887 



18932 



18977 



19022 



19067 



19112 



19157 



19202 



19247 



19292 



19337 



19382 
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gaa ggc etc ate ggc tgc ttc gec agt acc ctg gtg ctg cgc gcg 
Glu Gly Leu He Gly Cys Phe Ala Ser Thr Leu Val Leu Arg Ala 
2955 2960 2965 



19427 



aag ccc tec gee teg acc gee ttc acc tea tgg etc gee cag gtg 
Lys Pro Ser Ala Ser Thr Ala Phe Thr Ser Trp Leu Ala Gin Val 
2970 2975 2980 



19472 



tec gag cag etc cac ggc gee etc gag cac cag gag gtg cct ttc 
Ser Glu Gin Leu His Gly Ala Leu Glu His Gin Glu Val Pro Phe 
2985 2990 2995 



19517 



gaa cga ttg gtg gag gtg etc cag ccc egg cgc cgc atg gat egg 
Glu Arg Leu Val Glu Val Leu Gin Pro Arg Arg Arg Met Asp Arg 
3000 3005 3010 



19562 



cac ccg etc ttc cag ate ttc etc gee atg cag cag cac ccg ctg 
His Pro Leu Phe Gin He Phe Leu Ala Met Gin Gin His Pro Leu 
3015 3020 3025 



19607 



egg cgc gee gag ctg cca ggc ctg ctg etc teg gag ttc ccc ctt 
Arg Arg Ala Glu Leu Pro Gly Leu Leu Leu Ser Glu Phe Pro Leu 
3030 3035 3040 



19652 



cgc age cgc gtg gec cgc ttc gat ctg gag ttc cac etc tgg gag 
Arg Ser Arg Val Ala Arg Phe Asp Leu Glu Phe His Leu Trp Glu 
3045 3050 3055 



19697 



tct gec cag ggc gtc gag ggc acc etc ate tac gac gtg gac ctg 
Ser Ala Gin Gly Val Glu Gly Thr Leu He Tyr Asp Val Asp Leu 
3060 3065 3070 



19742 



tat ggc cag gag ggc gta gcg caa ctg gtg egg cgc tac gtg age 
Tyr Gly Gin Glu Gly Val Ala Gin Leu Val Arg Arg Tyr Val Ser 
3075 3080 3085 



19787 



ctg etc gaa gee gtg 
Leu Leu Glu Ala Val 
3090 



gcg gcg gac ccg acg egg aca ctg ggg gag 
Ala Ala Asp Pro Thr Arg Thr Leu Gly Glu 
3095 3100 



19832 



ctg gcg ggt gag acg ctg gee ccg ccc gta egg aag cag gtt etc 
Leu Ala Gly Glu Thr Leu Ala Pro Pro Val Arg Lys Gin Val Leu 
3105 3110 3115 



19877 



gee ctg teg gag tgt ccg ccc etc cca cgc 
Ala Leu Ser Glu Cys Pro Pro Leu Pro Arg 
3120 3125 



ccg age aca gee cct 
Pro Ser Thr Ala Pro 
3130 



19922 



egg aca ctg gee gag 
Arg Thr Leu Ala Glu 
3135 



gca ctt eta cag acc gee gag cgc ttc ccc 
Ala Leu Leu Gin Thr Ala Glu Arg Phe Pro 
3140 3145 



19967 



acg gee acg gtc age ttc gtc cag gca cag ggc tct tgc acg gee 
Thr Ala Thr Val Ser Phe Val Gin Ala Gin Gly Ser Cys Thr Ala 
3150 3155 3160 



20012 



tgg act ctg cca gag 
Trp Thr Leu Pro Glu 
3165 



ctg gtg gag egg gec egg cgc ctg cag gee 
Leu Val Glu Arg Ala Arg Arg Leu Gin Ala 
3170 3175 



20057 



ggc ttg egg cag tgg gga ttg egg ccg ggc gac agt ctg gtg ctg 
Gly Leu Arg Gin Trp Gly Leu Arg Pro Gly Asp Ser Leu Val Leu 



20102 
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gtg ctg ggg agg gag 
Val Leu Gly Arg Glu 
3195 

gtg ctg gcg gga gtg 
Val Leu Ala Gly Val 
3210 

aga gcg gag gcg agt 
Arg Ala Glu Ala Ser 
3225 

cag ctg ctg ggt ggg 
Gin Leu Leu Gly Gly 
3240 

ccc gat ctt gcc agg 
Pro Asp Leu Ala Arg 
3255 

ctg ggg gcc gtg gag 
Leu Gly Ala Val Glu 
3270 

ctg ccc cca ggg cgc 
Leu Pro Pro Gly Arg 
3285 



gaa gag 
Glu Glu 



gcg ccc 
Ala Pro 



ccc gcg 
Pro Ala 



ccc egg 
Pro Arg 



cag ctt 
Gin Leu 



gag ctg 
Glu Leu 



atg gat 
Met Asp 



teg 
Ser 



aac 
Asn 



cag 
Gin 



ggc 

Gly 



cac 
His 



caa 



gcg 
Ala 



val 



ggc acc 
Gly Thr 



ctg ctg 
Leu Leu 



ccg etc 
Pro Leu 



gcc ctg 
Ala Leu 



acc ttc 
Thr Phe 



tgg ctg 
•Trn Leu 



ccc age 
Pro Ser 



Q ■"- v y a C 
Glu Asp 



acc ggc 
Thr Gly 
3300 

gtg cgc 
Val Arg 
3315 

gag cgc 
Glu Arg 
3330 

tec gag 
Ser Glu 
3345 

cac get 
His Ala 
3360 

gag tgg 
Glu Trp 
3375 

ttc gcc 
Phe Ala 
3390 

egg cgc 
Arg Arg 
3405 



aag gcc 
Lys Ala 



ctg gag 
Leu Glu 
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3185 

acg gta gag 
Thr Val Glu 
3200 

ctg gtg eta 
Leu Val Leu 
3215 

etc tea cgt 
Leu Ser Arg 
3230 

gtc etc acc 
Val Leu Thr 
3245 

cag gtc teg 
Gin Val Ser 
3260 

aga gcc acg 
Arg Ala Thr 
3275 

gac gtg gcc 
Asp Val Ala 
3290 

aag tgc gcc 
Lys Cys Ala 
3305 

gcc acc aac 
Ala Thr Asn 
3320 
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gca etc tgg gcc 
Ala Leu Trp Ala 



ccg gcg ccg ccc 
Pro Ala Pro Pro 



ctg egg cat gcc 
Leu Arg His Ala 



egg cag gag atg 
Arg Gin Glu Met 



ccc acc gcc gac 
Pro Thr Ala Asp 



ggt ggc gag gcg 
Gly Gly Glu Ala 



ctg etc aac etc 
Leu Leu Asn Leu 



atg etc acg cac 
Met Leu Thr His 



gtc gtc tac gag 
Val Val Tyr Glu 



ggc etc 
Gly Leu 



tac cac 
Tyr His 



ccc acc 
Pro Thr 



etc gag 
Leu Glu 



tac age 
Tyr Ser 



tgg age 
Trp Ser 



gtc tgg ttg 
Val Trp Leu 
3335 



cag ctg cac aac 
Gin Leu His Asn 



3190 

tgc 
Cys 
3205 

gcg 
Ala 
3220 

egg 
Arg 
3235 

ctg 
Leu 
3250 

ate 
lie 
3265 

ccc 
Pro 
3280 

acc 
Thr 
3295 

cgc 
Arg 
3310 

tec 
Ser 
3325 

att 
lie 
3340 



etc cgc ccg 
Leu Arg Pro 
3350 

gag gag gtg 
Glu Glu Val 
3365 

egg tat ggc 
Arg Tyr Gly 
3380 

cac ctg ctg 
His Leu Leu 
3395 

ctg gga ggc 
Leu Gly Gly 
3410 



etc tgc gcc ggc 
Leu Cys Ala Gly 



etc gcc gag ccc 
Leu Ala Glu Pro 



att gca cag acg 
He Ala Gin Thr 



gag egg ctg cgc 
Glu Arg Leu Arg 



gtg cgc gtg ctg 
Val Arg Val Leu 



atg 
Met 
3355 

ctg 
Leu 
3370 

tgg 

Trp 
3385 

aag 
Lys 
3400 

etc 
Leu 
3415 



age gcc gga gag cag ate tec gcg ccc atg gtg gag gag ctg atg 



20147 



20192 



20237 



20282 



20327 



20372 



20417 



20462 



20507 



20552 



20597 



20642 



20687 



20732 



20777 



20822 
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Ser Ala Gly Glu Gin lie Ser Ala Pro Met Val Glu Glu Leu Met 
3420 3425 3430 
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egg agg ttg get ccc 
Arg Arg Leu Ala Pro 
3435 



teg gga gtg egg gag gat gec ttc gtc get 
Ser Gly Val Arg Glu Asp Ala Phe Val Ala 
3440 3445 



20867 



gee tgg gga atg ace gag acg get teg ggg gtg ace tat gcg egg 
Ala Trp Gly Met Thr Glu Thr Ala Ser Gly Val Thr Tyr Ala Arg 
3450 3455 3460 



20912 



egg ccc gga acg ccg ccc cgc atg cac acc ctg gag cgc gcg age 
Arg Pro Gly Thr Pro Pro Arg Met His Thr Leu Glu Arg Ala Ser 
3465 3470 3475 



20957 



ttg age ggt ccg ctt egg cac gcg gca ccc get teg ccc acc gcg 
Leu Ser Gly Pro Leu Arg His Ala Ala Pro Ala Ser Pro Thr Ala 
3480 3485 3490 



21002 



ctg egg ctg atg gat gta gga gee ccc att gee ggc acc gec ctg 
Leu Arg Leu Met Asp Val Gly Ala Pro lie Ala Gly Thr Ala Leu 
3495 3500 3505 



21047 



egg gtg gtg gac gcg age ggc gag ctg etc 
Arg Val Val Asp Ala Ser Gly Glu Leu Leu 
3510 3515 



tec gag gag tgc gtg 
Ser Glu Glu Cys Val 
3520 



21092 



ggg cgc att caa gtg 
Gly Arg lie Gin Val 
3525 



cgc ggc gag atg ate tct ccc ggc tac tac 
Arg Gly Glu Met He Ser Pro Gly Tyr Tyr 
3530 3535 



21137 



ggg gac ccg aag gee 
Gly Asp Pro Lys Ala 
3540 



teg gcg gcg ctg etc 
Ser Ala Ala Leu Leu 
3545 



acc gee gac ggc tgg 
Thr Ala Asp Gly Trp 
3550 



21182 



ctg gag aca ggc gat ctg ggc ttc etc tec gag gga gcg ctg acc 
Leu Glu Thr Gly Asp Leu Gly Phe Leu Ser Glu Gly Ala Leu Thr 
3555 3560 3565 



21227 



att acc ggc cgc gee aag gat ctg gtc ate ate cac ggc acc aac 
He Thr Gly Arg Ala Lys Asp Leu Val He He His Gly Thr Asn 
3570 3575 3580 



21272 



ttc tec tgc tac gag 
Phe Ser Cys Tyr Glu 
3585 



ate gag tec gcg gtg gag cag gtg gag gga 
He Glu Ser Ala Val Glu Gin Val Glu Gly 
3590 3595 



21317 



gtg gcg ccc tec teg gcg gcg gcg gcg gcg gtg egg atg etc gag 
Val Ala Pro Ser Ser Ala Ala Ala Ala Ala Val Arg Met Leu Glu 
3600 3605 3610 



21362 



ggg age agg gag gag ctg gcg gtc ttc ttc gtg ccc acg gag gga 
Gly Ser Arg Glu Glu Leu Ala Val Phe Phe Val Pro Thr Glu Gly 
3615 3620 3625 



21407 



etc get ccg caa ccc cct gee tea etc ctg teg cgc ate cgc cag 
Leu Ala Pro Gin Pro Pro Ala Ser Leu Leu Ser Arg He Arg Gin 
3630 3635 3640 



21452 



cag gtg etc gag cag gtg ggg gtg C gg ate gat cat etc ate ccg 
Gin Val Leu Glu Gin Val Gly Val Arg He Asp His Leu He Pro 
3645 3650 3655 



21497 
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etc gag cct cac cag etc cca cgc acg gag gga ggc aag ctg cgc 21542 
Leu Glu Pro His Gin Leu Pro Arg Thr Glu Gly Gly Lys Leu Arg 
3660 3665 3670 

cgc teg gag ctg agg gcg cgc ttc gaa gca ggt gag ctg agg gcg 21587 
Arg Ser Glu Leu Arg Ala Arg Phe Glu Ala Gly Glu Leu Arg Ala 
3675 3680 3685 

ccg caa ccc get ccg gtc ccg age ccc teg cgc ccg ctg gag cag 21632 
Pro Gin Pro Ala Pro Val Pro Ser Pro Ser Arg Pro Leu Glu Gin 
3690 3695 3700 

etc ate gee tec gtc tgg gcg gag gtg etc gag cac cag gac ate 21677 
Leu lie Ala Ser Val Trp Ala Glu Val Leu Glu His Gin Asp lie 
3705 3710 3715 

gcg ccc gag gee age ttc ttc gac ctg ggc ggc aac tec ate etc 21722 
Ala Pro Glu Ala Ser Phe Phe Asp Leu Gly Gly Asn Ser lie Leu 
3720 3725 3730 

ctg gtg cgc gtg gag egg gca ctg cgc gcg egg etc ggc ctg gag 217 67 

Leu Val Arg Val Glu Arg Ala Leu Arg Ala Arg Leu Gly Leu Glu 
3735 3740 3745 

ctg acg etc atg gac ctg ttc gee tac ccc acg gtg cac teg ctg 21812 
Leu Thr Leu Met Asp Leu Phe Ala Tyr Pro Thr Val His Ser Leu 
3750 3755 3760 

gcg gac tac ctg gag cct cga gee gee cag ctg cca gca cag gee 21857 
Ala Asp Tyr Leu Glu Pro Arg Ala Ala Gin Leu Pro Ala Gin Ala 
3765 3770 3775 

tec cct ccc acg cag gee gag cgc egg egg ggg atg cgc ggc gcg 21902 
Ser Pro Pro Thr Gin Ala Glu Arg Arg Arg Gly Met Arg Gly Ala 
3780 3785 3790 



gcg ctg gag cag aga cgc act cgc cgc cag gca caa egg gac gag 
Ala Leu Glu Gin Arg Arg Thr Arg Arg Gin Ala Gin Arg Asp Glu 
3795 3800 3805 

gac tag 
Asp 



21947 



21953 



<210> 34 
<211> 2625 
<212> PRT 

<213> Angiococcus discif ormis 
<400> 34 

Leu Ser Ala Gly Ala Leu Leu Ala His Ala Ala Ser Leu Gly Val Arg 
15 10 15 



7. <=>v, Trn Val Glu Gly Glu Arg Leu Arg Phe Gin Ala Pro Pro Gly Val 
20 25 30 
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Met Thr Pro Glu Leu Gin Ser Arg Leu Gly Gly Ala Arg His Glu Leu 
35 40 45 



He Ala Leu Leu Arg Gin Leu Gin Pro Ser Ser Gin Gly Gly Ser Leu 
50 55 60 



Leu Ala Pro Val Ala Arg Asn Gly Arg Leu Ala Leu Ser Phe Ala Gin 
65 70 75 80 



Gin Arg Leu Trp Phe Gin Glu Gin Leu His Pro Glu Ala Pro Ala Asn 
85 90 95 



Asn Leu Thr Gly Ala Val Val Phe Thr Gly Pro Leu His Val Ala Ala 
100 105 110 



Leu Leu Gly Ala Val Ala Ala Leu Val Arg Arg His Glu Ala Leu Arg 
115 120 125, 



Thr Thr Leu Gly Glu Glu Gly Gly Val Pro Tyr Ser Leu He Gly Glu 
130 135 140 



Pro Trp Gin Pro Ala Leu Glu Val Glu Ala Leu Pro Gly Ala Thr Val 
145 ~ 150 155 160 



Gly Glu Arg Leu Glu Gin Ala Arg Glu Val Ala Leu Ala Glu Ser Arg 
165 170 175 



Arg Arg Phe Ala Leu Glu Thr Glu Pro His Leu Arg Val Arg Leu Leu 
180 185 190 



Arg Leu Ala Glu Gin Gin His Val Leu Val Leu Ser Leu His His He 
195 200 205 



Ala Ala Asp Gly Val Gly Leu Gin Val Leu Glu Gin Glu Leu Ala Ala 
210 ^ 215 220 



Leu Tyr Gly Ala Leu Ser Ala Gly Ala Glu Pro Arg Leu Pro Pro Leu 
225 230 235 240 



Pro Leu Gin Val Ala Asp Leu Ala Asp Trp Gin Arg Arg Trp Val Glu 
245 250 255 



Gly Glu Glu Tyr Gin Val Gin Leu Ala Tyr Trp Arg Arg Gin Leu Ala 
260 265 270 



Gly Leu Thr Pro Leu Glu Val Pro Gly Asp His Pro Arg Pro Arg He 
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275 280 285 



Pro Ser Met Arg Gly Ala Glu Val Arg Ala Pro Leu Leu Ser Ala Pro 
290 295 300 



Gin Ala Gin Val Leu Arg Ala Leu Gly Gin Gly Glu Gly Ala Thr Leu 
305 310 315 320 



Tyr Met Thr Leu Leu Ala Ala Leu Gly Val Leu Leu Gin Arg Trp Thr 
325 330 335 



Gly Gin His Asp Met Ala Val Gly Ser Ala Ala Ala Asn Arg Asn Arg 
340 345 350 



Pro Gly Leu Glu Gly lie Leu Gly Phe Leu Leu Asn He Val Leu Leu 
355 360 365 



Arg Leu Asp Leu Arg Gly Arg Pro Arg Phe Arg Glu Leu Leu Arg Gin 
370 375 380 



Ala Arg Arg Val Cys Val Glu Ala Tyr Ala His Gin Glu Leu Pro Phe 
385 390 395 400 



Glu His Leu Val Glu Ala Leu Gin Pro Gly Ser Glu Arg Gly Asp Ser 
405 410 415 



Ser Leu Tyr Arg Val Ala Leu Ala Val Ser Asp Thr Pro Trp Met Pro 
420 425 430 



Gly His Gly Leu Lys Leu Glu Gly Val Gin Ala Gin Pro Leu Asp Phe 
435 440 445 



Pro Arg Gly Val Leu Asp Leu Asp Leu His Leu Trp Val Tyr Asp Thr 
450 455 460 



Gly Glu Gly Leu Thr Gly Arg Leu Glu Tyr Ala Val Asp Leu Tyr Glu 
465 470 475 480 



Glu Pro Thr Ala Arg Arg Leu Leu Glu Gly Phe Arg Gin Val Leu Glu 
485 490 495 



Ala Val Val Glu Ala Pro Asp Arg Pro Val Pro Glu Leu Pro Val Leu 
500 505 510 



Gly Glu Gin Glu Arg His Gin Val Leu Ser Gly Trp Asn Arg Thr Gin 
515 520 525 
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Arg Pro Tyr Pro Arg Glu Ala Ser Val His Gly Leu Phe Gin Gin Arg 
530 " 535 540 

Ala Leu Gin Ala Pro Arg Ala Val Ala Val Val Tyr Gly Glu Arg Ser 
545 550 555 560 

Leu Thr Tyr Gly Glu Leu Ala Glu Arg Ala Arg Gly Leu Ala Gin Gly 
565 570 575 

Leu Val Ala Arg Gly Val Arg Arg Gly Asp Leu Val Ala Leu Arg Leu 
580 585 590 

Glu Arg Ser Pro Glu Gin Val Glu Ser Met Leu Ala Val Leu Gin Ala 
595 600 605 

Gly Ala Ala Tyr Val Pro Leu Asp Pro Ser Tyr Pro Val Gin Arg Gin 
610 615 620 

Glu Phe Met Leu Gin Asp Ser Gly Ala Arg Leu Leu Val His Ser Gly 
625 630 635 640 

Pro Leu Pro Phe Ala Pro Gin Gly Cys Ala Thr Leu Asp Leu Gin Ala 
645 650 655 

Trp His Pro Ala Pro Ser Asp Gly Gly Glu Pro Leu Pro Gin Cys Ser 
660 665 670 

Gly Glu Asp Leu Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr Gly Gin 
675 680 685 

Pro Lys Gly Val Ala Val Cys His Arg Ala Met Thr Arg Leu Val Cys 
690 695 700 

Asn Thr Asp Tyr Val Gin Leu Gly Pro Glu Asp Arg Val Ala Gin Ala 
705 710 715 720 

Ser Asn Ala Ser Phe Asp Ala Ala Thr Phe Glu Val Trp Gly Ala Leu 
725 730 735 

Leu Asn Gly Ala Arg Leu Val Gly Leu Ala Thr Glu Glu Ala He Gin 
740 745 750 

Ala Arg Arg Leu Ala Glu Val Leu Arg Glu Gin Arg lie Ser Val Leu 
755 760 765 



Phe Val Thr Thr Ala Leu Phe Asn His Val Ala Arg Glu Gin Pro Gin 
770 775 780 
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Ala Phe Ser Thr Leu Arg Tyr Leu Leu Phe Gly Gly Glu Ala Val Asp 

785 790 795 800 

Ala Ser Ser Val Arg Arg Val Leu Lys Gin Gly Ala Pro Gly His Leu 
805 810 815 

Leu His Val Tyr Gly Pro Thr Glu Asn Thr Thr Phe Ser Thr Ala Trp 
820 825 830 

Arg Val Glu His Leu Ala Glu Gin Ala His Thr Val Pro Met Gly His 
835 840 845 

Pro lie Ala Asn Ser Arg Leu His Val Leu Asp Glu Ala Leu Gin Pro 
850 855 860 

Val Pro Val Gly Ala Met Gly Glu Val Tyr Leu Gly Gly Asp Gly Leu 
865 870 875 880 

Ala Leu Gly Tyr Trp Arg His Pro Glu Ala Thr Ala Glu Arg Phe Val 
885 890 895 

Pro Asp Pro His Gly Leu Glu Pro Gly Gly Arg Leu Tyr Arg Thr Gly 
900 905 910 

Asp Leu Ala Arg Arg Gin Ala Asp Gly Ala Val Val Phe Ala Gly Arg 
915 920 925 

Val Asp Arg Gin Val Lys Leu Arg Gly Phe Arg Val Glu Pro Ala Glu 
930 935 • 940 

lie Glu Ser His Leu Cys Glu His Ser Glu Val Ser Ala Ala Val Val 
945 950 955 960 

Glu Leu Arg Gly Glu Gly Ala Leu Arg Arg Leu Val Ala Tyr Val Val 
965 970 975 

Pro Arg Ala Gly Gly Arg Pro Gly Ala Glu Glu Leu Arg Thr Phe Leu 
980 985 990 

Arg Thr Arg Leu Pro Glu Tyr Met Leu Pro Ala Ser Phe Ser Leu Leu 
995 1000 1005 

Clu Ala Leu Pro Leu Thr Pro Asn Gly Lys Val Asp Arg Ser Ala 
•010 1015 1020 



r.^, qi u ser Phe Glu Glu Ala Ser Arg Glu Gin Ala Pro Val 

1025 1030 1035 
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Val Pro Pro Arg Gly Pro Val Glu Ala Leu Leu Val Asp lie Trp 
1040 ^ ^ 1045 1050 

Arg Glu Val Leu Gly Thr Gin Arg Val Ser Val His Asp Asp Phe 
1055 1060 1065 

Phe Asp Leu Gly Gly His Ser Leu Leu Ala Thr Arg Val Val Ser 
1070 1075 1080 

Arg Leu Arg Glu Ala Leu Gin Val Glu Leu Pro Leu Arg Thr Leu 
1085 1090 1095 

Phe Glu Ala Pro Gin Leu Ser Ala Leu Ala Ala Gin Val Glu Val 
1100 1105 1110 

Leu Leu Gly His Arg Gin Leu Arg Pro Pro Pro Leu Val Pro Ala 
1115 1120 1125 

Val Arg Pro Pro Glu Leu Pro Leu Ser Phe Ala Gin Gin Arg Leu 
1130 1135 1140 

Trp Phe Leu Gin Gin Leu Ala Pro Gin Ser Thr Ala Tyr Gin lie 
1145 1150 1155 

Leu Asp Ala Trp His Val Arg Gly Arg Val Asp Val Gly Ala Leu 
1160 1165 1170 

Glu Arg Ala Leu Glu Gin Leu Val Arg Arg His Glu Ala Leu Arg 
1175 ' 1180 1185 

Thr Thr Phe Glu Pro Gly Gly Asp Gly Val Pro Arg Gin Arg lie 
1190 1195 1200 

His Ala Pro Ala Pro Val Pro Leu Arg Gin Val Asp Leu Arg Ser 
1205 1210 1215 

His Gly He Ala Ala Arg Glu Glu Ala Leu Arg Trp Met Arg Glu 
1220 1225 1230 

Gin Ala Leu Arg Pro Leu Glu Leu Asp Lys Gly Pro Leu Leu Arg 
1235 ^ 1240 1245 

Val Ser Leu Leu Arg Leu Glu Glu Ala Gin Ser Leu Leu Phe Leu 
1250 1255 1260 



Glu Leu His His He Val Gly Asp Gly Trp Ser Leu Ser Val Trp 
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1265 1270 1275 

Ser Arg Glu Leu Ser His Leu Tyr Glu Ala Ala Leu His Gly Ala 
1280 1285 1290 

Glu Pro Gly Leu Ala Pro Leu Pro Val Gin Tyr Ala Asp Phe Ala 
1295 1300 1305 

Leu Trp Gin Arg Gly Trp Leu Gin Gly Pro Val Leu Arg Glu Glu 
1310 1315 1320 

Leu Thr Trp Trp Arg Glu Arg Leu Ala Arg Leu Ala Pro Leu Arg 
1325 1330 1335 

Leu Pro Ala Asp His Ala Arg Pro Glu Val Gin Arg Phe Asn Gly 
1340 1345 1350 

Ala Thr Tyr Arg Phe Thr Leu Pro Gly Val Arg Val Gin Ala Leu 
1355 1360 1365 

Arg Arg Leu Gly His Glu His Gly Ala Thr Leu Phe Met Val Leu 
1370 1375 1380 

Leu Ala Gly Phe Asn Ala Leu Leu Ala Arg Tyr Thr Gly Gin Thr 
1385 1390 ~ 1395 

Asp He Ala He Gly Ala Pro He Ala Asn Arg Thr Arg Gly Glu 
1400 1405 1410 

Val Glu Gly Leu He Gly Phe Phe Val Asn Thr Leu Val Leu Arg 
1415 1420 1425 

Thr Arg Leu Glu Gly Asn Pro Ser Phe Leu Glu Leu Leu Arg Arg 
1430 1435 1440 

Val Arg Glu Thr Thr Leu Glu Ala Tyr Ala His Gin Glu Leu Pro 
1445 1450 1455 

Phe Glu Arg Leu Val Glu Glu Leu Gin Pro Glu Arg Gin Ala Asn 
1450 1465 1470 

Gin Asn Pro Leu Val Gin Val Leu Leu Ala Leu Gin Asn Ala Pro 
1475 1480 1485 



Arg Giu Pro Leu Arg Leu Ala Gly Leu Glu Ala Glu His Leu Glu 
1490 1495 1500 
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Tyr Leu Val Ala Thr Thr Arg Phe Asp Leu Glu Leu His Leu Trp 
1505 1510 1515 

Glu Glu Glu Glu Gly Leu Ser Cys He Ala Val Tyr Asp Arg Ala 
1520 1525 1530 

Leu Tyr Gly Ala Gly Thr Val Glu Arg Leu Val Gly Ala Trp Cys 
1535 1540 1545 

Thr Leu Leu Glu Gly Val Ala Glu Leu Pro Ala Arg Arg Val Ala 
1550 1555 1560 

Glu Leu Pro Leu Val Pro Ala Arg Glu Leu Arg Gin Val Pro Pro 
1565 1570 1575 

Pro Ser Pro Ala Ala Ala He Glu Thr Ser He Gly Ala Arg Phe 
1580 1585 1590 

Ser Glu Val Ala Arg Arg Gin Pro Gly Ala Thr Ala Val Thr Gin 
1595 1600 1605 

Gly Gly Arg His Leu Thr Tyr Ala Glu Leu Glu Glu Arg Ser Glu 
1610 1615 1620 

Arg Leu Ala Arg Tyr Leu Ala Trp Leu Gly Val Arg Ala Gly Asp 
1625 1630 1635 

Arg Val Gly Leu Ala Thr Glu Arg Thr Leu Glu Arg He He Ser 
1640 1645 1650 

Leu Leu Gly He Leu Lys Ala Gly Ala Ala Tyr Val Pro Leu Asp 
1655 1660 1665 

Val Arg Gin Pro Ala Arg Arg Leu Ser Leu Leu Val Gin Ala Ala 
1670 1675 1680 

Gly Val Arg Thr Val He Ala Glu Glu Gin Ala Arg Thr Val Leu 
1685 1690 1695 

Ser Gly Leu Gly Gin Pro Leu Thr Leu Val Asp Ala Ala Gin Glu 
1700 1705 1710 

Pro Ala Ser Ala Gin Gin Val Pro Ala Leu Gly Pro Glu Arg Ser 
1715 1720 1725 



Leu Gly Gly Asp Met Leu Ala Tyr Val Leu Phe Thr Ser Gly Ser 
1730 1735 1740 
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Thr Gly Glu Pro Lys Gly Val Cys lie Pro His Arg Ala Val Leu 

1745 1750 1755 

Arg Leu lie His Glu Pro Ser Tyr Val Gin Leu Ser Pro Arg Glu 

1760 1765 1770 

Val Met Leu His Tyr Ala Pro Leu Glu Phe Asp Ala Ser Thr Phe 

1775 1780 1785 

Glu Val Trp Gly Ala Leu Leu Asn Gly Ala Arg Leu Val Leu Val 

1790 1795 1800 

Pro Pro Glu Gin Gin Ser Leu Glu Ser Leu Gly Gin Glu Leu Ser 

1805 1810 ,1815 

Thr Gin Gly Val Thr Val Leu Trp Leu Thr Ala Gly Leu Phe Arg 

1820 1825 1830 

Leu Met Val Glu Glu Gin Leu Lys Ser Leu Arg Gly Val Arg Gin 

1835 1840 1845 

Leu Leu Ala Gly Gly Asp Val Leu Pro Met Pro Gin Val Arg Arg 

1850 1855 1860 

Leu Arg Glu Ala Leu Pro Glu Cys Gin Leu lie Asn Gly Tyr Gly 

1865 1870 1875 

Pro Thr Glu Ser Cys Thr Phe Thr Cys Cys His Arg Val Gly Ser 

1880 1885 1890 

Pro Gin Glu Leu Gly Gly Ser Val Pro lie Gly Thr Pro lie Asp 

1895 1900 1905 

Leu Gly Trp Val Ser Val Val Asp Glu Arg Leu Gin Pro Val Pro 

1910 1915 1920 

Asp Gly Ala Pro Gly Glu Leu Leu Val Gly Gly Pro Gly Leu Ala 

1925 1930 1935 

Trp Gly Tyr Leu Gin His Pro Glu Leu Thr Ala Glu Arg Phe lie 

1940 1945 1950 

Pro Asp Pro Leu Ser Arg Thr Pro Gly Ala Arg Val Tyr Arg Thr 

1955 1960 1965 

Gly Asp Leu Val Arg Arg Arg Glu Asp Gly Thr Leu Glu Phe Leu 

1970 1975 ^ 1980 
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Gly Arg Val Asp His Gin Leu Lys Val Arg Gly Phe Arg lie Glu 
1985 1990 1995 



Pro Gly Glu Val Glu Ala Ala Val Leu Thr His Pro Ala Val Gin 
2000 2005 2010 



Ser Ala Val Val Val Gly Arg Glu Gly Pro Gly Gly Lys Glu Leu 
2015 2020 2025 



Val Cys Tyr Ala Val Pro Arg Val Glu Ser Ser Glu Gin Gly Ser 
2030 2035 2040 



Gin Gin Glu Gin Arg Leu Val His Glu Trp Glu Ser Val Phe Asp 
2045 2050 2055 



Gly His Met Tyr Arg Glu Ala Pro Val Gly Gly Glu Pro Thr Phe 
2060 2065 2070 



Asn lie Val Gly Trp Lys Ser Ser Tyr Thr Gly Gin Pro Val Ala 
2075 2080 2085 



Val Glu Glu Met Arg Asp Trp Leu Arg His Arg Val Glu Arg Val 
2090 . 2095 2100 



Arg Gly Leu Arg Pro Arg Arg lie Leu Glu Val Gly Cys Gly Thr 
2105 2110 2115 



Gly Leu Met Leu Phe Ala Leu Leu Pro His Cys Glu Arg Tyr Val 
2120 2125 2130 



Gly Thr Asp Phe Ser Pro Ala Ala Leu Asp Tyr Val Arg Arg Tyr 
2135 2140 2145 



Leu Pro Pro Glu His Pro Gly Arg Val Glu Leu Leu His Arg Thr 
2150 2155 2160 



Ala Asp Glu Trp Ser Gly Val Ala Ala Gly Ser Phe Asp Ala Val 
2165 2170 2175 



Leu Leu Asn Ser Val Val Gin Tyr Phe Pro Ser Gin Glu Tyr Leu 
2180 2185 2190 



Arg Gin Val Leu Ala Arg Cys Val Glu Ala Val Glu Asp Gly Gly 
2195 2200 2205 



Phe Val Phe Val Gly Asp Val Arg Ser Leu Pro Leu Leu Glu Ser 
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2210 2215 2220 

Phe His Ala Ser Val Glu Leu Glu Arg Ala Ala Pro Ser Met Pro 
2225 2230 2235 

Leu Glu Ala Trp Arg Glu Arg Val Arg Arg Ala Val Leu Glu Asp 
2240 2245 2250 

Asn Glu Leu Val Val Asp Pro Ala Leu Phe Val Ala Leu Ala His 
2255 2260 2265 

Gin His Pro Arg Val Ser His Val Asp lie Glu Leu Thr Arg Gly 
2270 2275 2280 

Thr His Pro Asn Glu Met Ala Arg Phe Arg Tyr Asn Ala Val Leu 
2285 2290 2295 

His He Gly Pro Arg Thr Pro Pro Pro Ala Ser Glu Val Pro Trp 
2300 2305 2310 

Val Asp Trp Ser Thr Gin Gly Leu Ser Leu Asp Ala Leu Arg Ala 
2315 2320 2325 

Arg Leu Arg Gin Gly Pro Pro Gly Pro Leu Gly Val Ala Gly He 
2330 2335 2340 

Pro Asn Ala Arg Val Leu Pro Ala Val Arg Ala Ala Glu Ala Leu 
2345 2350 2355 

Gly Ser Thr Gly Ser Ala Arg Arg Val Glu Glu Leu Arg Arg Arg 
2360 2365 2370 

Leu Ser Gin Pro Gly Thr Gly Ala Gin Asp Pro Asp Pro Phe Trp 
2375 2380 2385 

Gin Leu Ala Glu Ser Leu Gly Tyr Thr Ala Ala Val Ser Trp Ser 
2390 2395 2400 

Pro Gly Arg Arg Asp Gly Ala Phe Asp Val Leu Phe Leu Pro Ala 
2405 2410 2415 

Thr Pro Gly Met His Pro Arg Trp Leu Gly Pro Thr Pro Leu Asn 
2420 2425 2430 



: rr Thr Pro Pro Pro Ala Ser Ala Arg Leu Ser Ser Glu Pro Arg 
2435 2440 2445 
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Arg Ala Ser Leu Ser Leu Arg Leu Gly Ser Ala Leu Arg Ala His 

2450 2455 2460 

Leu Gin Thr His Leu Pro Asp Phe Met Val Pro Ser Arg Phe Val 

2465 2470 2475 

Val Leu Gin Ser Leu Pro Leu Thr Pro Asn Gly Lys Val Asp Arg 

2480 2485 2490 

Ala Ala Leu Pro Val Pro Asp Ser Arg Arg Leu Glu Ser Ala Pro 

2495 2500 2505 

Leu Val Pro Pro Ser Asn Glu Leu Glu Arg Val Leu Ala Gin Val 

2510 2515 2520 

Trp Lys Glu Val Leu Gly Leu Glu Glu Val Ser Arg Glu Asp Asn 

2525 2530 2535 

Phe Phe Asp Val Gly Gly His Ser Leu Leu Leu Ala Gin Val Cys 

2540 2545 2550 

Ser Arg Leu Glu Ala Arg Leu Gly Arg Arg Leu Glu Leu Val Thr 

2555 2560 2565 

Leu Phe Arg Tyr Ser Ser He Ala Ala Leu Ala Glu His Leu Gin 

2570 2575 2580 

Ala Pro Gin Glu Leu Ala Ala Ala Glu Ala Gin Val Gin Arg Met 

2585 2590 2595 

Ala Thr Glu Arg Ala Leu Leu Gin Gin Gin Ala Ala Gin Arg Arg 

2600 2605 2610 

Arg Ala Gly Ser Lys Arg Gly Ala Pro Asn Asp Thr 

2615 2620 2625 

<210> 35 
<211> 1181 
<212> PRT 

<213> Angiococcus discif ormis 



<400> 35 



Met Ser Lys Leu His Glu Glu Leu Glu Ser Leu Ala Pro Glu Gin Arg 
15 10 15 
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Glu Leu Leu Ala Ala Leu Met Lys Glu Gin Gly Leu Asp Glu Gly Ala 
20 25 30 

Leu Leu Met Pro Val Glu Arg Lys Pro Glu Gly Leu Pro Leu Ser Ser 
35 40 ~' 45 

Ala Gin Gin Arg Met Trp Phe Leu Gin Gin Leu Arg Pro Thr Ser Pro 
50 55 60 

Phe Tyr Asn Val His Ala Ala Leu Arg Leu Thr Gly Lys Leu Lys Val 
65 70 75 80 

Glu Cys Leu Val His Ser Leu Asn Glu Phe Val Arg Arg His Glu Pro 
85 90 95 

Leu Arg Thr Val Phe Pro Ser Ala Gly Gly Gin Pro Leu Gin Arg lie 
100 105 110 

Leu Ala Pro Ala Pro Ala Ala Leu Glu Gin Arg Asp Leu Ser Gly Val 
115 120 125 

Pro Ala Gin Glu Arg Glu Ala Glu Val Tyr Arg Ala Val Glu His Ala 
130 135 140 

Ala Leu Ala Ser Phe Asp Leu Glu Arg Glu Pro Pro Cys Arg Phe Leu 
i45 150 155 160 

Leu Val Arg Val Glu Pro Glu Glu His Val Leu Val Phe Ala Thr His 
165 170 175 

His lie Ala Ala Asp Gly Trp Ser Leu Gly Val Phe Val Gin Glu Leu 
180 185 190 

Cys Ala Leu Tyr Thr Ala Ala Val His Ala Glu Pro Pro Ala Leu Pro 
195 200 205 

Pro Leu Arg Leu Gin Tyr Ala Asp Phe Ala Ala Trp Glu Arg Ser Arg 
210 215 220 

Leu Lys Gly Gly Arg Glu Arg Glu Leu Leu Glu Tyr Trp Gin Glu Gin 
225 230 235 240 

Leu Ala Gly Leu Pro Asp Leu Ser Thr Leu Pro Pro Glu Arg Pro Arg 
245 t 250 255 



Pro Trc Leu Ser Lys Gin Glu Gly Ala Thr Phe Glu Phe Ala Leu Pro 
260 265 270 
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Pro His Gin Val Gin Ala Leu Arg Ser Leu Ala Gin Ala Arg Arg Thr 
275 280 285 

Ser Leu Phe Ser Val Leu Leu Ala Ala Phe Gin Trp Leu Leu Ala Arg 
290 295 300 

Cys Ala Gly Gin Asp Asp Val Ala Leu Gly Met Pro lie Ala Asn Arg 
305 310 315 320 

Asn Arg Lys Glu Leu Glu Gly Leu lie Gly Cys Phe Ala Ser Thr Leu 
325 330 335 

Val Leu Arg Ala Lys Pro Ser Ala Ser Thr Ala Phe Thr Ser Trp Leu 
340 345 350 

Ala Gin Val Ser Glu Gin Leu His Gly Ala Leu Glu His Gin Glu Val 
355 360 365 

Pro Phe Glu Arg Leu Val Glu Val Leu Gin Pro Arg Arg Arg Met Asp 
370 375 380 

Arg His Pro Leu Phe Gin He Phe Leu Ala Met Gin Gin His Pro Leu 
385 390 395 400 

Arg Arg Ala Glu Leu Pro Gly Leu Leu Leu Ser Glu Phe Pro Leu Arg 
405 410 415 

Ser Arg Val Ala Arg Phe Asp Leu Glu Phe His Leu Trp Glu Ser Ala 
420 425 ~ 430 

Gin Gly Val Glu Gly Thr Leu He Tyr Asp Val Asp Leu Tyr Gly Gin 
435 440 445 

Glu Gly Val Ala Gin Leu Val Arg Arg Tyr Val Ser Leu Leu Glu Ala 
450 455 460 

Val Ala Ala Asp Pro Thr Arg Thr Leu Gly Glu Leu Ala Gly Glu Thr 
465 470 475 480 

Leu Ala Pro Pro Val Arg Lys Gin Val Leu Ala Leu Ser Glu Cys Pro 
485 490 495 

Pro Leu Pro Arg Pro Ser Thr Ala Pro Arg Thr Leu Ala Glu Ala Leu 
500 505 510 



Leu Gin Thr Ala Glu Arg Phe Pro Thr Ala Thr Val Ser Phe Val Gin 
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515 520 525 

Ala Gin Gly Ser Cys Thr Ala Trp Thr Leu Pro Glu Leu Val Glu Arg 
530 535 540 

Ala Arg Arg Leu Gin Ala Gly Leu Arg Gin Trp Gly Leu Arg Pro Gly 
545 550 555 560 

Asp Ser Leu Val Leu Val Leu Gly Arg Glu Glu Glu Thr Val Glu Ala 
565 570 575 

Leu Trp Ala Cys Val Leu Ala Gly Val Ala Pro Leu Val Leu Pro Ala 
580 585 590 

Pro Pro Ala Arg Ala Glu Ala Ser Pro Ala Leu Ser Arg Leu Arg His 
595 600 605 

Ala Arg Gin Leu Leu Gly Gly Pro Arg val Leu Thr Arg Gin Glu Met 
610 S15 620 

Leu Pro Asp Leu Ala Arg Gin Leu Gin Val Ser Pro Thr Ala Asp lie 
625 630 635 640 

Leu Gly Ala Val Glu Glu Leu Arg Ala Thr Gly Gly Glu Ala Pro Leu 
645 650 655 

Pro Pro Gly Arg Met Asp Asp Val Ala Leu Leu Asn Leu Thr Ser Gly 
660 665 670 

Thr Thr Gly Lys Ala Lys Cys Ala Met Leu Thr His Arg Asn Leu Leu 
675 680 685 

Val Arg Leu Glu Ala Thr Asn Val Val Tyr Glu Ser Gin Pro Leu Glu 
690 695 700 

Arg Gly Leu Val Trp Leu Gin Leu His Asn lie Gly Ala Leu Ser Glu 
705 710 715 ~ 720 

Tyr His Leu Arg Pro Leu Cys Ala Gly Met His Thr Phe His Ala Pro 
725 730 735 

Thr Glu Glu Val Leu Ala Glu Pro Leu Arg Trp Leu Glu Trp Leu Glu 
740 745 75Q 



Arg Tyr Gly lie Ala Gin Thr Trp Ala Pro Ser Phe Ala Tyr Ser His 
755 760 765 
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Leu Leu Glu Arg Leu Arg Lys Val Glu Asp Arg Arg Trp Ser Leu Gly 
770 775 780 

Gly Val Arg Val Leu Leu Ser Ala Gly Glu Gin lie Ser Ala Pro Met 
785 7 *0 795 800 

Val Glu Glu Leu Met Arg Arg Leu Ala Pro Ser Gly Val Arg Glu Asp 
805 810 815 

Ala Phe Val Ala Ala Trp Gly Met Thr Glu Thr Ala Ser Gly Val Thr 
820 825 830 

Tyr Ala Arg Arg Pro Gly Thr Pro Pro Arg Met His Thr Leu Glu Ara 
835 840 845 

Ala Ser Leu Ser Gly Pro Leu Arg His Ala Ala Pro Ala Ser Pro Thr 
850 855 860 

Ala Leu Arg Leu Met Asp Val Gly Ala Pro He Ala Gly Thr Ala Leu 
865 870 875 880 

Arg Val Val Asp Ala Ser Gly Glu Leu Leu Ser Glu Glu Cys Val Gly 
885 890 895 

Arg He Gin Val Arg Gly Glu Met He Ser Pro Gly Tyr Tyr Gly Aso 
900 905 910 

Pro Lys Ala Ser Ala Ala Leu Leu Thr Ala Asp Gly Trp Leu Glu Thr 
915 920 925 

Gly Asp Leu Gly Phe Leu Ser Glu Gly Ala Leu Thr He Thr Gly Arg 
930 935 94Q 

Ala Lys Asp Leu Val He He His Gly Thr Asn Phe Ser Cys Tyr Glu 
945 950 955 960 

He Glu Ser Ala Val Glu Gin Val Glu Gly Val Ala Pro Ser Ser Ala 
965 970 975 

Ala Ala Ala Ala Val Arg Met Leu Glu Gly Ser Arg Glu Glu Leu Ala 
980 985 990 

Val Phe Phe Val Pro Thr Glu Gly Leu Ala Pro Gin Pro Pro Ala Ser 
995 1000 1005 

LSU Ser ^ g Ile Arg Gln Gln Val Leu Glu Gin Val Gly Val 

1010 1015 1020 
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Ar3 ?i? B ASP H±S Leu Ile Pro Leu Glu Pro His Gin Leu Pro Arg 
1025 1030 1035 

Thr Glu Gly Gly Lys Leu Arg Arg Ser Glu Leu Arg Ala Arg Phe 
1040 1045 1050 



Glu Ala Gly Glu Leu Arg Ala Pro Gin Pro Ala Pro Val Pro Ser 
1055 1060 1065 

Pr ° ?«™ ** g Pr ° LeU Glu Gln Leu Ile Ala Ser Val Trp Ala Glu 
1070 1075 1080 

Val = G1U His Gln Asp Ile Ala Pro Glu Ala Ser Phe Phe Asp 
1085 1090 1095 

LSU ?Sfn Gly ASn SSr I1S LSU Leu Val ^ Val Glu Arg Ala Leu 
1100 1105 IHO 

Arg Ala Arg Leu Gly Leu Glu Leu Thr Leu Met Asp Leu Phe Ala 
H15 1120 1125 

^ ™ Thr Val His Ser Leu Ala As P L eu Glu Pro Arg Ala 

H30 1135 114Q 

Ala ?S C LSU Pr ° Ala Gln Ala Ser Pro Pro Thr Ala Glu Arg 

H45 1150 1155 

Arg fff „ GlY Met Arg Gly Ala Ala Leu Giu Glu Arg Arg Thr Arg 
H60 1165 1170 

Arg Gln Ala Gln Arg Asp Glu Asp 
H75 1180 

<210> 36 

<211> 2200 

<212> DNA 

<i'13> Angiococcus disciformis 



<220> 

<221> cps 

^ZZZ-' { 44 j . . (970) 

■» c j > 05j22 6 



WO 2004/022586 




CT/EP2003/009780 



<220> 

<221> CDS 

<222> (973) .. (2181) 

<223> ORF25 



<220> 

<221> misc_feature 

<222> (2195) .. (2195) 

<223> n=A, C, G or T 



<400> 36 

gggagcgcgc tcggcaaggg caggcgtctg ttagggtcta cct atg agg tta cgt 55 

Met Arg Leu Arg 
1 

^° 5 fc ? o CC °? C gtt CCC ggg Cta ttg ctg gtg ct ^ ct S tec atg gec 103 
Pro Met Pro His Val Pro Gly Leu Leu Leu Val Leu Leu Ser Met Ala 

5 10 15 20 

gca tgg gca gec gaa gcg cag acg gec ccc ggc cct tec ctt cgt ttc 151 
Ala Trp Ala Ala Glu Ala Gin Thr Ala Pro Gly Pro Ser Leu Arg Phe 
25 30 ~ 35 

cgc cgc gtg ccc etc act gga aag ccc gag gtg gtg egg etc gca acg 199 
Arg Arg Val Pro Leu Thr Gly Lys Pro Glu Val Val Arg Leu Ala Thr 
40 45 50 



ggc att ccc gcg acg etc aac ttc gac gcg gag ate aac ccg cag aaa 
Gly lie Pro Ala Thr Leu Asn Phe Asp Ala Glu lie Asn Pro Gin Lys 
55 60 65 

gtg egg ctg gat get ccg gee cgc gtc aag att ttg age acc ggg gag 
Val Arg Leu Asp Ala Pro Ala Arg Val Lys He Leu Ser Thr Gly Glu 
70 75 80 



247 



295 



aS lit £? ST r Ctg Ctt g f C gag ggC ffat ttg ggc gaa ggg ^ tg act 343 
Arg Ser Val Thr Leu Leu Ala Glu Gly Asp Leu Gly Glu Gly Val Thr 

85 90 95 " ioo 

ttg cgc gtg ccg ttc ctg act gag ccc tec etc gcg gaa ccc gee ttt 391 
Leu Arg val Pro Phe Leu Thr Glu Pro Ser Leu Ala Glu Pro Ala Phe 
105 H0 115 

r CtC ^ g gCt tCg gaC gtg gcg gat tcg ca S atg gtc tac 439 
Gin Leu Val Thr Ala Ser Asp Val Ala Asp Ser Gin Val Met Val Tyr 
120 125 130 

cgc age gec aac gcg ccg gag ttg atg caa gca egg ctg acg gag ttg 48 7 
Arg Ser Ala Asn Ala Pro Glu Leu Met Gin Ala Arg Leu Thr Glu Leu 
135 140 145 
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gag gcc cgc age gec gcg tgc gag gcg caa ctt tec acc gag cgc gag 535 
Glu Ala Arg Ser Ala Ala Cys Glu Ala Gin Leu Ser Thr Glu Arg Glu 
150 155 160 

cgc age gag gcc acg ggc ccc gcc gcc tgg gtc etc tec egg caa gtt 583 
Arg Ser Glu Ala Thr Gly Pro Ala Ala Trp Val Leu Ser Arg Gin Val 
165 170 175 180 

gat gag ggc ggc gtg age gtt gtc atg etc aag cga ctg tec gac gtg 631 
Asp Glu Gly Gly Val Ser Val Val Met Leu Lys Arg Leu Ser Asp Val 
185 190 195 

ggc acg gcg ggc att gag gca acg tec gtt cgc cgc ttc ctg gcg gat 679 
Gly Thr Ala Gly lie Glu Ala Thr Ser Val Arg Arg Phe Leu Ala Asp 
200 205 210 

aag tgg gcg gtg etc caa gtg caa gtg agg aac ggc tct ggg cag cct 727 
Lys Trp Ala Val Leu Gin Val Gin Val Arg Asn Gly Ser Gly Gin Pro 
215 220 225 

tgg egg cca ggg egg gca tgg ctg gaa agt gcg gcc acc agg cgc etc 775 
Trp Arg Pro Gly Arg Ala Trp Leu Glu Ser Ala Ala Thr Arg Arg Leu 
230 235 240 

acg gaa gcg cgc acg gtg age atg gcg ccg gaa gtg ctg gcg ccc gaa 
Thr Glu Ala Arg Thr Val Ser Met Ala Pro Glu Val Leu Ala Pro Glu 
245 250 255 



260 



~ac att gcc aag gtg ctg ggc tgg aca cgc act gac gag ggt tac ggc 
Aan Ixe Ala Lys Val Leu Gly Trp Thr Arg Thr Asp Glu Gly Tyr Gly 

395 400 



823 



ggt aag age cgc gtg gtg gtg gag ttt gcc ggg aag ccc ggg age aag 871 
Gly Lys Ser Arg Val Val Val Glu Phe Ala Gly Lys Pro Gly Ser Lys 
265 270 275 

ggg gaa gtc ttc cga ttg gtg gta gag gac gca gac ggc gtc egg ccc 919 
Gly Glu Val Phe Arg Leu Val Val Glu Asp Ala Asp Gly Val Arg Pro 
280 285 290 

etc tea gtc gcg ggc gtg gtg atg aag gac aca gcg cag gag aag age 967 
Leu Ser Val Ala Gly Val Val Met Lys Asp Thr Ala Gin Glu Lys Ser 
295 300 305 

ggc cc atg atg acc gtg gat ccc ccc aac ttc etc aag ccc ggc gcc 1014 
Gly Met Met Thr Val Asp Pro Pro Asn Phe Leu Lys Pro Gly Ala 
310 315 320 

cgc etc ggg gcc ccc tac egg att cgc ggt tgg aag ggc ggc ggt gcc 1062 
Arg Leu Gly Ala Pro Tyr Arg lie Arg Gly Trp Lys Gly Gly Gly Ala 
325 330 335 

tat ggc gag gtg tac gag tgc gaa agg cag ggc agg ccc tac gcc etc 1110 
Tyr Gly Glu Val Tyr Glu Cys Glu Arg Gin Gly Arg Pro Tyr Ala Leu 
340 345 350 355 

aag ctg tec aag cac cgc cag tec age gat gac ccg ggc aag acg gac 1158 
Lys Leu Ser Lys His Arg Gin Ser Ser Asp Asp Pro Gly Lys Thr Asp 
360 365 370 

caa egg ctg ctg egg gag ttg gtg tgc etc gtc caa ctg gac cac ccg 1206 
Glr. Arg Leu ^ Qlu Leu Val Cys ^ ^ ^ ^ ^ ^ ^ 

375 380 385 



1254 
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tat ctg gtg ctg gag tac ate gag ggg tgg acg ctg gec caa tgg etc 
Tyr Leu Val Leu Glu Tyr lie Glu Gly Trp Thr Leu Ala Gin Trp Leu 
405 410 415 



1302 



cag caa acg cgc ccc acc gtc gec caa gtc ctg cac gtc ttc gec aaa 1350 
Gin Gin Thr Arg Pro Thr Val Ala Gin Val Leu His Val Phe Ala Lys 
420 425 430 435 

ctg acg ggg gcg etc gcg tac atg cac cgc agg ggc gtt egg cac egg 1398 
Leu Thr Gly Ala Leu Ala Tyr Met His Arg Arg Gly Val Arg His Arg 
440 445 450 

gac gtg teg ctg tct aac gtc atg gta cgc aag gcg gac ggc gag ccc 1446 
Asp Val Ser Leu Ser Asn Val Met Val Arg Lys Ala Asp Gly Glu Pro 
455 460 465 

gtc etc att gac etc ggg gcg ggg gac tac tec ggg gcg cat gag ttg 1494 
Val Leu lie Asp Leu Gly Ala Gly Asp Tyr Ser Gly Ala His Glu Leu 
470 475 480 

acg gac tgg ccc ttg ccc ccg ggc acc gac cgc tac cgc tck ccc gar 1542 
Thr Asp Trp Pro Leu Pro Pro Gly Thr Asp Arg Tyr Arg Xaa Pro Glu 
485 490 495 

gcg gec cgc ttc etc aag gya cac cag aac gac ccg age gec egg tac 1590 
Ala Ala Arg Phe Leu Lys Xaa His Gin Asn Asp Pro Ser Ala Arg Tyr 
500 505 510 515 

gac ttc ccc ccg gaa gat gac ttc tac teg ctg gcg gtg tgc etc tat 1638 
Asp Phe Pro Pro Glu Asp Asp Phe Tyr Ser Leu Ala Val Cys Leu Tyr 
520 525 530 

gac gcg ctg acg gac gcg gag ccc gee etc aag gcg aac gca cgc aag 1686 
Asp Ala Leu Thr Asp Ala Glu Pro Ala Leu Lys Ala Asn Ala Arg Lys 
53 5 540 545 

gee ccg cgc etc aac gtc aac age ccc acc atg gee ccg ccc ccg gee 1734 
Ala Pro Arg Leu Asn Val Asn Ser Pro Thr Met Ala Pro Pro Pro Ala 
550 555 560 

cgc aag gtc aac ccg cgc gtg ccc gag gta gtg agt gee tgg gta gag 1782 
Arg Lys Val Asn Pro Arg Val Pro Glu Val Val Ser Ala Trp Val Glu 
565 570 575 

egg tgg ctg gtg cgt gac gtg gaa acc egg egg ccc gcg ttg gee gee 1830 
Arg Trp Leu Val Arg Asp Val Glu Thr Arg Arg Pro Ala Leu Ala Ala 
580 585 590 595 

atg ccc ggc gcg ctg gaa gag ttg gcg aag caa ggc ggc gcg gag tgg 187 8 

Met Pro Gly Ala Leu Glu Glu Leu Ala Lys Gin Gly Gly Ala Glu Trp 
600 605 610 

ctg gee acc gtt cag ccc cca ccg gaa gcg ggc acc ccg gee ccc gag 192 6 

Leu Ala Thr Val Gin Pro Pro Pro Glu Ala Gly Thr Pro Ala Pro Glu 
615 620 625 

cac egg etc tac egg aag egg egg cgc gtg ctg gcg tgg gec atg gec 1974 
His Arg Leu Tyr Arg Lys Arg Arg Arg Val Leu Ala Trp Ala Met Ala 
630 635 640 

get ggc gtg etc gtg get gtc gcg gcg etc gcg tgg ctg cgc get gec 2022 
Ala Gly Val Leu Val Ala Val Ala Ala Leu Ala Trp Leu Arg Ala Ala 
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64 5 650 655 

ccc tct cca gac gcg ccg ccg tec cag gtc gta ccg gag gca cca age 2070 
Pro Ser Pro Asp Ala Pro Pro Ser Gin Val Val Pro Glu Ala Pro Ser 
660 665 670 675 

acg ccg cag gcg cca gee ccg gac agt cca cca acc ccg ccc gtg ccg 2118 
Thr Pro Gin Ala Pro Ala Pro Asp Ser Pro Pro Thr Pro Pro Val Pro 
680 685 690 

tct teg ggt gca gta ccg gtc ccc ate cca ggt gtc cct gtt gag aag 2166 
Ser Ser Gly Ala Val Pro Val Pro He Pro Gly Val Pro Val Glu Lys 
695 700 705 



gaa age cct ctg tga gtacccccac gaantcegg 
Glu Ser Pro Leu 
710 

<210> 37 

<211> 309 

<212> PRT 

<213> Angiococcus disciformis 
<220> 

<221> misc_f eature 

<222> (2195) . . (2195) 

<223> n=A, C, G or T 

<400> 37 

Met Arg Leu Arg Pro Met Pro His Val Pro Gly Leu Leu Leu Val Leu 
1 5 10 15 

Leu Ser Met Ala Ala Trp Ala Ala Glu Ala Gin Thr Ala Pro Gly Pro 
20 25 30 

Ser Leu Arg Phe Arg Arg Val Pro Leu Thr Gly Lys Pro Glu Val Val 
35 40 45 

Arg Leu Ala Thr Gly He Pro Ala Thr Leu Asn Phe Asp Ala Glu He 
50 55 60 

Asn Pro Gin Lys Val Arg Leu Asp Ala Pro Ala Arg Val Lys He Leu 
65 70 75 80 



2200 



Ser Thr Gly Glu Arg Ser Val Thr Leu Leu Ala Glu Gly Asp Leu Gly 
85 90 95 
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Glu Gly Val Thr Leu Arg Val Pro Phe Leu Thr Glu Pro Ser Leu Ala 

100 10 5 no 

Glu Pro Ala Phe Gin Leu Val Thr Ala Ser Asp Val Ala Asp Ser Gin 
115 120 125 

Val Met Val Tyr Arg Ser Ala Asn Ala Pro Glu Leu Met Gin Ala Arg 
13 0 135 140 

Leu Thr Glu Leu Glu Ala Arg Ser Ala Ala Cys Glu Ala Gin Leu Ser 
145 15 0 155 160 

Thr Glu Arg Glu Arg Ser Glu Ala Thr Gly Pro Ala Ala Trp Val Leu 
165 170 175 

Ser Arg Gin Val Asp Glu Gly Gly Val Ser Val Val Met Leu Lys Arg 
180 185 190 

Leu Ser Asp Val Gly Thr Ala Gly lie Glu. Ala Thr Ser Val Arg Arg 
195 200 205 

Phe Leu Ala Asp Lys Trp Ala Val Leu Gin Val Gin Val Arg Asn Gly 
210 215 220 

Ser Gly Gin Pro Trp Arg Pro Gly Arg Ala Trp Leu Glu Ser Ala Ala 
225 230 235 240 

Thr Arg Arg Leu Thr Glu Ala Arg Thr Val Ser Met Ala Pro Glu Val 
245 250 255 

Leu Ala Pro Glu Gly Lys Ser Arg Val Val Val Glu Phe Ala Gly Lys 
260 265 270 

Pro Gly Ser Lys Gly Glu Val Phe Arg Leu Val Val Glu Asp Ala Asp 
275 280 285 

Gly on. ^ g Pr ° LSU Ser Val Ala Val Val Met Lys Asp Thr Ala 

290 295 300 

Gin Glu Lys Ser Gly 
305 



<210> 
<211> 
<212> 



38 

402 

PRT 
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<213> Angiococcus disciformis 



<220> 

<221> misc_f eature 
<222> (188).. (188) 

<223> The 'Xaa' at location 188 stands for Ser. 
<220> 

<221> misc_f eature 
<222> (197) . . (197) 

<223> The 'Xaa' at location 197 stands for Ala, or Val. 
<220> 

<221> misc_feature 
<222> (2195) . . (2195) 
<223> n=A, C, G or T 
<400> 38 

Met Met Thr Val Asp Pro Pro Asn Phe Leu Lys Pro Gly Ala Arg Leu 
1 5 10 15 

Gly Ala Pro Tyr Arg lie Arg Gly Trp Lys Gly Gly Gly Ala Tyr Gly 
20 25 30 

Glu Val Tyr Glu Cys Glu Arg Gin Gly Arg Pro Tyr Ala Leu Lys Leu 
35 40 45 

Ser Lys His Arg Gin Ser Ser Asp Asp Pro Gly Lys Thr Asp Gin Arg 
* v 55 60 

Leu Leu Arg Glu Leu Val Cys Leu Val Gin Leu Asp His Pro Asn lie 
65 70 75 80 

Ala Lys Val Leu Gly Trp Thr Arg Thr Asp Glu Gly Tyr Gly Tyr Leu 
85 90 ~ 95 

Val Leu Glu Tyr He Glu Gly Trp Thr Leu Ala Gin Trp Leu Gin Gin 
100 105 110 

•W,r Arg Pro Thr Val Ala Gin Val Leu His Val Phe Ala Lys Leu Thr 

120 125 

.-.I* Leu Ala Tyr Met His Arg Arg Gly Val Arg His Arg Asp Val 
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Ser Leu Ser Asn Val Met Val Arg Lys Ala Asp Gly Glu Pro Val Leu 
145 150 155 160 

lie Asp Leu Gly Ala Gly Asp Tyr Ser Gly Ala His Glu Leu Thr Asp 
165 170 175 

Trp Pro Leu Pro Pro Gly Thr Asp Arg Tyr Arg Xaa Pro Glu Ala Ala 
180 185 19Q 

Arg Phe Leu Lys Xaa His Gin Asn Asp Pro Ser Ala Arg Tyr Asp Phe 
195 200 205 

Pr ° of« G1U ASP ASP Phe ^ Ser Leu Ala Val CVS Leu Tyr Asp Ala 
210 215 220 

Leu Thr Asp Ala Glu Pro Ala Leu Lys Ala Asn Ala Arg Lys Ala Pro 
225 230 235 240 

Arg Leu Asn Val Asn Ser Pro Thr Met Ala Pro Pro Pro Ala Arg Lys 
245 250 255 

Val Asn Pro Arg Val Pro Glu Val Val Ser Ala Trp Val Glu Arg Trp 
260 265 270 

Leu Val Arg Asp Val Glu Thr Arg Arg Pro Ala Leu Ala Ala Met Pro 
275 280 285 

GlY S G1U G1U LeU Ala LyS Gln G± y Ala Glu Trp Leu Ala 

290 295 300 

Thr Val Gin Pro Pro Pro Glu Ala Gly Thr Pro Ala Pro Glu His Arg 
305 31 <> 315 320 

Leu Tyr Arg Lys Arg Arg Arg Val Leu Ala Trp Ala Met Ala Ala Gly 
325 330 335 

Val Leu Val Ala Val Ala Ala Leu Ala Trp Leu Arg Ala Ala Pro Ser 
340 345 ~ 350 

Pro Asp Ala Pro Pro Ser Gin Val Val Pro Glu Ala Pro Ser Thr Pro 
355 360 365 

Gin Ala Pro Ala Pro Asp Ser Pro Pro Thr Pro Pro Val Pro Ser Ser 
370 375 380 
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W 274 W 

Gly Ala Val Pro Val Pro lie Pro Gly Val Pro Val Glu Lys Glu Ser 
385 390 395 400 

Pro Leu 

<210> 39 

<211> 21 

<212> DNA 

<213> artificial sequence 
<220> 

<223> K-388 primer 

<400> 39 

tgggaatcat ttgaaggttg g 

<210> 40 

<211> 22 

<212> DNA 

<213> artificial sequence 
<220> 

<223> K-389 primer 

<400> 40 

tgtgtttttc tttgttagac eg 

<210> 41 

<211> 21 

<212> DNA 

<213> artificial sequence 
<220> 

<223> ASTIslA primer 

< 4 0 0 > 41 

cacccggacc tgcctggatt c 21 
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<210> 42 

<211> 21 

<212> DNA 

<213> artificial sequence 
<220> 

<223> ASTIslB primer 

<400> 42 

tgctcggctg gcgctactca c 

<210> 43 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> ASTIs2A primer 

<400> 43 

gctcccgggc cacgtggttg aaga 

<210> 44 

<211> 23 

<212> DNA 

<213> artificial sequence 
<220> 

<223> ASTIs2B primer 

<400> 44 

ccgcgggccg tggcagtggt gta 

<210> 45 

<211> 17 

<212> DNA 

<213> artificial sequence 
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<220> 



<223> TIs-up primer 



<400> 45 
tggcagccag cccgagc 



17 



<210> 46 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> TIs-down primer 

<400> 46 

ccgcgggtgc cctctcatc 19 



